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FIGURE 1A: Polynucleotide Sequence of Phage G anuria (y) (SEQ ID NO ; 1 ) 

CTCAACTTCGCAGAAAAATCCGTTTTTGCATATTTTTTTAAGGGGGTGTAATCATGGCTGGAAGA^ATAAACAACCACTC 

T CT GT TAT AC AG G G AAAAG GT AG AT CAAAT C AC ATT AC AAAAAGT GAG AAAAAC AG AC G AGAAAAAC AAG AAG AA G CAT T 
GCGGGGGCATACTGATAAAATTGAAGCTCCTTCTTATTTGACTGCAGCACAAAAAAAGGAATTCGATACTTTAGCTGCTG 
AATTAGTCAGATTGAAAATTTTCAGTAACTTAGATGTTGACAGTTTAGCAAGGTACGTTGATTCTAAAGACCAATATATA 
AAAATGGTTCGTCTGCTAAGAAAAACAAAACCTTCAGATGACTTTAAATTGTAT^ 

TTTATTCAATGAATGCCGTTCTTCAGCTAGTGATTTAGGTTTGACCATTACATCCCGCTTAAAATTAGTTATTCCAGAAG 
TAGATACTTCACAACAAAAGCAAAGTGAAGCGCAAAAGCGTTTTGGTGATCGTATATGAACTGGATAATGGAACGGGTTT 
TTGCATATTGCGAGGACATTTTAAACGGCAAGATAAATAGTTGTAAAAAACATCGTTGGGCCATTGAACGATTTATAAGG 
GATTATGAGGAGTGTCAAAGTGAAGACAGTCCTTTTTATTTTGATGGAGAGATAGCGGAGGATTTTTACTGGTTTGCAAA 
GGAATTTAAG CACGTTGAAGGGATTTTGGCAGGTGAATCCGTAGAATTAACTGATTTTCAATTGTTTCTAGCGGCTAATA 
TTTTCGGATTCAAAAAGAAAATAAATGGAGCAAGGCGATTTAGAAAGGTTTTTATTCAGTTAGCGCGTAAAAATGCTAAA 
TCTCAGTTTCTTGCTATTGTAGCAGCTTTTTGTACATTTCTTGGAGACGAAAAACAACGGGCTTATATTGCTGGATGGAC 
AAGAGACCAATCATCTGAAGTTTATGAAGCTGTAAAAACAGGGATTAGTTCTAGTGAATTGTTAGAAGGTAAATGGAAAG 
AGGCTTATAGTACCATTGAAATATTTAAGAATGGTTCAGTTGTCGTTCCACTTTCAAAAGAAGCTAGAAAAACTGGTGAT 

GGTAAAAAC CCGTCTCTTG GAATTGTC GAT GAAT AT C AT GC ACAT GAAACT GATGAAATTTAT GAC GT T TTAT C GT C T G G 
TATGGTGGCAAGGAAAGAGCCGTTAATGTTTATCATAACAACAGCTGGTTTCGACTTATCAAGACCTTGTTATAGAGAGT 

AT GAGTATGTCAGTGACATCTTAGACC CGT CAAAAAAT GTAGAAAACGAT GATTATTTCGTTAT GAT CT GT GAA.TTGGAA 
AAGAACGATGATATCAAAGATGAGTCGAATTGGATAAAAGCAAACCCAATCGTAGCTACATATGAAGAAGGTTTGGAAGG 
TATACGTTCAGATTTGAAGGTTGCTCTTGATAGACCTGAAAAGATGAGGGCTTTTTTAACCAAAAACATGAATATTTGGG 
T C GAT AAAAAG GACAACG GAT ACAT GGAT AT GT CAAAAT GGC AAAAAT GCGAAGT AGAT ACCTTT GATTT TT C AGGT GC G 
ACT CTTT GGATAG GT GGCGACTTAT CAAT GACAACAGATTTAACTAGT GT CGGTTGGGTT GGAAT GGAC GATGAAGGT GA 
TT TT ATT GTT GGACAACATT CATTTAT GCCTGAAGCACGTTTGAAAGAAAAGAT GG C CAT AGATAAG GT GC GTT ATGATT 
TATGGGCCGAACAAGGGTATTTAACTTTAACGCCTGGTGAAATGGTTGATTATACAATTGTTGAGTCTTGGATAGAAAAC 
TTTTCAAAAGACAAAGAAATTCAAGAGTTTGATTACGATAAATGGAATGCGTTACATCTAGCACAAAATTTAGAGAATAA 
AG G GTTC GTT T GT G TAGAAAT C C CTCAAAGGATTGCTAATT TAT CC ATT CCGACTAAAAATTTTCGAGAAT^AAGTATACG 

AAAAGAAAGTT AAACATAAT GGAGAT C CAGT CCTTTTTTGGGCG C TTAATAAT GCT GTT GTTAAAAT G GAT GAT CAG GAA 
AACAT TAT GATTT C GAAAAAAATAAGTAAAAAT C GT ATT GAT C CAGCAG CAG C G GTCTTAAAT G C AT TT T C T AGG G CT AT 
GTATGGAGCAAGTGTCAGGTTTGATGTATCTGAATTTGCAAATAAAGACTTTCTAGGCAAGTTATGGAACTAGGGAGGGG 

GT GAACAT GTGAAGAT AGTGGATTCT GTTAAAAAGTTCTTTAATTTTGAAAAACGCCAAACGT CGCAGGT AAT AGAGTT G 
AAT AAAGACGAT GAAAAATTATTAGAATGGCTAGG GATTT CTCCAAGTACTATTAGCGTTAAAGGAAAAAAT GCTTT AAA 
AGTT GCTACAGT CTTT GCTTGTAT CAAAAT ACT AT CT GAAT CCGTAT CAAAGTTACCGTT GAAAATTTAT CAGGAAGAT G 
AATATGGAAT CCAAC GCGGTACAAAGCATTATCT CAACAATTTACTGAGACTAAGGCCTAACC CGTATATGT C CAGTAT G 
AACTTTTTC G GAT CATTAGAAGCT CAAAAAAATTTATAT GG CAAT A GCT AC GCTAACATAGAGTTT GATAGAAAAG GTAA 
AGT C CAAG C GTT AT G G C C GATAGAT G C TT CTAAAGT GACAGTATACATT GATGAC GTT G GTTT AT TAAATT C C AAAACTA 
AAAT GT GGTAT GTAGT AAATACG G GT G GACAACAAAGAGT GT TAAAG CCAGAAGAGATACT GCACTT T AAAAAC G GAATA 
ACTCTTGATGGTCTTGTCGGTGTTCCTACAATGGAATATTTAAAGTCTACATTAGAAAATTCAGCTTCAGCTGATAAATT 
CAT AAAT AATTTTT ACAAACAAGGGTT ACAGGTAAAGG GATTAGTTCAATATGTCGGT GATTTAAAT GAAGAT GCGAAA& 
AG GTTTT CCG AGA-AAAT TT CG AAT CAAT G TCT AGC GGT CT T CAAAAT AG C CA.TCGT ATT G CAT TAAT GCC AG TAG GAT AT 
CAATT T C AAC CT ATT T CATTAAAT AT GT CAGAT GCT C AATTT CT CGAAAAT ACCGAACTTACT ATT AG G CAAAT C G CT AC 
T G C ATT C G GCATT AAAAT G CAT C AAT T AAAT GAT T T GAGT AAAGC GACTTT AAATAATAT T GAG CAG CAG CAACAACAAT 
T CT ATACC GAT AC ATT ACAAGC GACTTT AACAAT GTAT GAG CAAGAAAT GAC GTAT AAG CT ATTTTT AGACAGT GAGTT G 
GAT AAG G GGT TT TATT CAAAATT C AAT GTAGAC G CTATT TTAAGAG C G GAT AT CAAAAC GAGATAT GAAGCTT AC AGAAC 
GGGTATTCAAGGCGGTTTCCTTAAACCTAACGAAGCTAGAAGTAAAGAAGATTTTACCACCAGAAGCTGGTGGGGATCGTT 

TAC TT GT TAAT GGAAATAT GTT GCC GATT GATAT G G C T G GACAG G CAT ATT T GAAG GGAG GT GAT AC TAAT G GAGAAGT C 
AG C AAAGAAGG AAAT GAAG GAAAT TAGAG CTT T G C CAAT GACTAT T GAAGT C CGT G AAGTT AAT GAG GAC GAG G GAAAAC 
GAACAATTT C G G GAT C GAT AAAAT ATAACAAT GAAAGT G C C GAAAT G C GT GACT G GT G G G G CGATAC TTT C GT AGAAGAG 
^TTGCTGAGGGAGCTTTTGATGAAAGTTTAAAAGTTCGTGATGTTGTAGGTTTATGGTCTCACGACACATCTCAAGTATT 

AG GAAAT ACTAAAAGTAAAACTTT AC GAATC GAAAAT G ACAAGAAAGAAT T ACGATTTG AATT AGAT ATT CCT AATAC AA 
CT GTT GG GAAT GAC GC AT GGGAATTAATTAAGCGT GGAGAT GTT GAT GGAGTTTCTTTT GGGAT GAAG GTT AC AAAAG AC 
AAATGGT CAT C GGAAGAAC GT GAAAAT GGAAAG CTTTATAAGC GTT CGATTTTAAAT GCT GAACT ATAT GAAAT AT CACC 
GGTTGCATTCCCTGCATATCCAACGAATGAAGTAAGTGTACGTTCATTGGATGATTTTAAAGCTGGAGAAAAGCGAGTAG 
CTGATGAGTTTAGGAAAAGAAAACTACAAATCGAACTAGAGCTTATATAAGGCTCTTTTTTTATTGATAAATTTAAGGAG 
T GATTT GAAT GTCAAAAGAATTACGTG AATT ATTAGCTAAGTTAG AAG GG AAAAAG GAA GAAGTAC GCT CTCTTATGGGA 
GAAGATAAAGTGGCAGAAGCAGAACAAATGATGGAAGAAGTGCGATCACTTCAGAAAAAAATTGATTTACAACGCTCATT 

AGAT GAAG C AGAAAC GGAAGAAC GAAAT AAT G GAAGAGAA GTT GAAACACG TAAT GT AGAT G GT GAAAT G GAAT AC C G C G 
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ATGTGTTTATGAAAGCATTACGCAATAAACCATTAAATGCTGAAGAACGTGAATTTCTTGAGGATGATTTAGAACAACGT 

GCCATGTCAGGATTAACTGGGGAAGATGGAGGACTTGTCATCCCTCAAGATATTCAAACGCAAATCAATGAATTAGCTCG 

TTCATTTGATGCGCTTGAGCAATATGTAACTGTTGAACCAGTGCGTACACGTTCAGGATCACGAGTATTAGAGAAAAATT 

CAGATATGATTCCGTTTGCTGAAATCACTGAAATGGGTGAAATTCCAGAAACTGATAATCCGAAATTTTCAAATGTACAA 

TATGCAGTGAAGGACAGAGCAGGTATTTTACCGTTATCTCGTTCATTACTTCAAGATAGTGATCAAAACATCCTAAAGTA 

TGTGACTAAATGGCTAGGTAAG7\AATCTAAAGTTACACGTAATGTGTTAATCTTGGGCGTAATTGAAAAGTTAACAAAAC 

AAGCAATCAAATCTCTGGATGATATTAAAGATGTATTAAATGTTAAATTAGACCCAGCGATTTCTCCGAATGCGATTTTA 

CTT AC AAAC C AAGAT G GAT TTAATT AT TTAGACAAATT AAAAGAT AAAGAC G G AAAAT AT ATTTT AC AGTC AG AT CCAAC 

GCAAAAAAACAAAAAACTATTTGCTGGTACTAATCCAGTCGTTGTTGTTTCGAATCGTTTCTTAA7VATCAAAGGGAACTA 

CAGCTAAAAAAGCGCCACTTATTATTGGTGATTTAAAAGAAGCTATTGTTTTATTTAAACGTGAAGATATGGAACTGGCT 

TCTACAGATGTAGGTGGTAAAGCATTCACTCGTAATACATTAGATTTACGCGCAATTCAACGTGATGATGTGCAAATGTG 

GGATAATGAAGCAGCAGTTTACGGAGAAATCGATTTAAGCGCTCCTGTTGAACAACCTCAAGGGTAAACTAAGGAGGCAT 

TTGAATGCTTGTTACCTTAGAAGAAGCTAAAGAATGGATTCGAGTGGACGGAGACGATGACCCAACTATCACTATGTTAA 

TTAAAGCGGCTGAATTATATATTTACAAAGCAACTGGCAAAACATTTACTCAAACAAATGAAGATGCTAAGTTGCTTTGT 

TTATTTCTGGTGGCTGATTGGTACGGAAATCGACTACTTGTAGGTGAAAAAGCCAGTGAAAAAATCAGAACCATTGTTCA 

G AGT AT GAT ATTA CAGCTCCAATAT G CTT CAGAG C CT CAGGAG GAAAGAAAAT GAAT C C T G CAAAATT AG ATAAAC G G CT 

TA.CATTT CAAGTAAAAGATGAAAAT GCAAAAGGGCCT GACGGT GATCCGAT AGAT GGAT AT AAAGAT GCTTTTACC GTAT 

GGGGCTCTTTTGTTTATTTAAAGGGAAGGAAATACTTTGAGGCAGCAGCTGCTAATAGTGAGGTTCAAGGAGAAACAGAA 

AT C AGAAAT CG G GAT GATGTAAGT G CAGATAT GAAAATTAAGT ACAAAAAC GT GATTTAT GAT ATT GTTT C C GTT AT T CC 

AACT CAAGAT CATACTTTATTAAT CAT GT GGAAAC GT GGTGAAAT GAAT G G CT GAT G GTAT AGAT TT AG AT TT ATT AG GA 

TTTGATCGTTTAGTTACTGAATTAGACCAAATGGGGTTACGGGGAGAGAAAATTGAAGATAAAGCTCTTGCAGCTGGTGG 

TGAACCTATTCGTAAAGCCATTGCAGAACGAGCGCCAAG7\AGCCC7\AGCCCCAAAAAACGATCTAAAAGTGAACCGTGGC 

GTACAGGGCAACATGGTGCAGACCAGATAAAAGTAACAAAAGCTAAACTTGAAGGTGGAATAAAAACAGTAAAAATAGGT 

CTTAATAAAGCGGATCGTTCCCCGTGGTTCTATTTAAAGTTCCATGAATGGGGTACATCCAAAATGCCAGCACATCCATT 

T AT AGAGCC GG GT TTTAAT GCTT C AAAAG CG GAAG CT GTAC GT GC TATGACAGATATTT TAAAGAAC GAAAT G AGGTTG G 

ATTT GT GATAAATTT AAGACCT GATATTTTACAAG CT CTT GAGAATGAT CAAGAG CTT GTTT C ATT GTT G G GT GG GAAAC 

GAATTTATTACCGTAAAGCAAAGAAGGCAGAAGAGTTTCCGCGAATTACGTATTTTGAATTAGACAATAGGCCAGATGGA 

TTTGCAGATAATCAAGAGATTGAAAGTGAAATCTTGTTTCAAGTTGATGTTTGGGCAAAGAGTAGTACAACAGCAATCCA 

T CAAAAAGT GAAT GAAAT CAT GAAAAGAATT GGTTT CT C ACGCTAT GC GG TT G CT GAT TTAT AT GAAGAG G ATACACAAA 

TATTTCATTATGCGATGAGATTCGCAAAAGGAGTGGAATTATAAATGGCTGGAGAAGTTGTAAGAATTAGTTCAACGGTT 

G GT GT AGACAACCTT GTAT AT GC GAAAGTTTTAC AAGAT GATT CGT CT GCT ATTAAATATACAGAT GTAAAGAAAAT G GA 

AGGTGCTGTAAAGGTT7^AATTAACTA7 i AAAAGTAGCTTCTGAGGTTATGTGGAGCGATAACAGAAAATCAGAGATTGCAG 

AATCTGATGGCGAAACTGAAGTGGAGATTGAGGTTCGAGGACTTTCACTTTCTACAAAGGCTGACATTGAAGGGTTTCCA 

GAAGTAAAAGAT G GC GTTTTAGAT GAGAAAC GT GAAG GT GAGAAAC CAT AT T TAG CTATT GGT T T CC GATT CTTAAAAGC 

TAAT GATAAGT AT C GATAT GTTT GGT T ATT AAAAG G GAAACTTT C ACAAGAG GAAGAAGAAGCT GAAAC GAAAAAAGACA 

AACCGAACTTC CAAACAACAAAATT GAAAGGTT CCTTTATT GAAC GTGATTTT GATGATAGAACGAAATTTACAGCAGAT 

GAAG AT GAAC C AACGTT C ACAAAAT T AGTTG GAG AT AATT G GTT T AATAAAGT AT AT G AAAAACCAGT G AC ACAACC ACC 

AGCAGGAAAGTAAGAGGGAGCAAAAGCTCTCTCTTTTTTATTAAATTTAGGAGGGAAAAACTATGAAATTAACATTAATG 

ATTAATAAAGAAAAACAAACTTTTAATATGCCAG7VATTTATTCCAGCCCGCCTTATTCGTCAGGCTCCTGAACTTGCTGA 

AATT C CAAACAAT C C T GGT CCAGAAGATAT G GAT AAAAT GGT T CAATT CGTAGT GAAAGT T TAT GAT GGT C AATTT ACAT 

T AGAT CAGT ATT GGG AT GGT GTT GAT G C C C GTAAATT CTT AT C GAC AACTT C AGAT GTAATTAAC GCAATTATAAAT GAA 

AC AGT G GAAG C AG C AG GG GGT AGT ACT GAAT CAG G AG AAG AAGAAAACCCAAACGCAT AG AGG GAGG AGG GCT AAC GTT C 

AG T GAGTTTAT G GAC GAGCT C TAC CT CT CTTTATT G C GACAAGG GTACAAACAC CAT CACAT T GATAAT GAG AT GGATAT 

T T G G CAT TATTT GAGACTTAAT C GAAAAAT G CAT GAAAAC G GAAAT GAAAATTAC GAAG G CT CCAATT CAAAT GAAAT AG 

AAGT G C CAG C GGAAAACATTATT TAAC GAGG GAG GT GAGAC TAT G GCGAAT GAAAT AAATAAT C TAGT C GT T AGACTT T C 

C CTT GAT AAC GTAAATTT CAGACAAG GTAT CT C GAATT CAG GT C GT GCAGT CAG GAC GTTAC AGAAT G AATT GAAATCT G 

T AAGT AC AG GAAT GGG CG GTTT T GCTAACGCTAGT C AGCAAAC AC AAGCG AAAAT GAAT ACACT CAGT AG G CT C ATT GAT 

G C G C AAAAAG AGAAAGTTAAAG C GT TAC GACAAG C CTATGATCAAAATAAG G CTAAATTAG GT GAAAAT GAT GCAG CAAC 

CCAGCGATATGCTTCGCAAGTTAATAAGGCAGTTGCTGATTTAAATAGATTTGAAAATGAATTAAAGCAAGTAAACCGTC 

AAG C T GAACAAAAAG G GAT G GATAAGT TAAACAACT CTT TAAAAT C C CTAC AAGCT GAAT T T CAGT CT AT T ACAACAG GT 
ATGGGCGGTTTTTCTAATGCGACAGAACAAACAAGGGCTAAAGTAGATGTTTTATCCCGTATGGTAGATAAACAAAAAGA 
GAAGATTAGGGAACTTCAACAAGCCTATAATCGTGCTAAAACAGAAGAAGGCGAAGCGAGTCAATCAGCACAAAGATACG 

CT GAACAAAT T CAT C G G G CAACAGCT GAAC T GAAT C GAT T T GAAACT G GATT ACAG CAGT CAAAT CGT GAAT T AGAACAG 
CAAGGGAATCGCCTATTGAACTTCGGAAATCGCATGGAGACATTAGGTAATCATTTGCAAAATGCCGGAATGCAGATCGG 
CATGGTATTTGGTGGTATGACTTACGCAATAGGTCGGGGCTTAAAATCAGCAATCACTGAATCAATGAATTTTGAGCAAC 

AGAT GG C CAAT GT AAAAG CT GTTT CT GGAT CTACT G GAGC AGAAAT G AAAAAGTTAAGT GAATT G G CT GTT AATAT G G GA 
GAAAC AAC AAAAT ACTCC AGT GTTCAAGCAGGTCAAGGTATCGAGGAATTAATAAP.GGCTGGTGTTAGCTTACAAG AT AT 
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TATTAACGGCGGATTGGCAGGTGCCCTTAACTTAGCGACGGCAGGGGAATTAGAGTTAGGTGAAGCAGCCGAAATTGCTT 

CCACAGCTCTGAATGCATTTAAAGCAGACCATCTTTCAGTTGCGGATGCAGCCAATATTTTATCTGGTGCAGCCAATGCT 

TCCGCAACTGATGTAAGAGAGTTAAAATATGGACTTTCAGCTTCATCAGCAGTAGCAGCGGGAGCCGGAATGACGTTTAA 

GGATACAGCTACAACTTTAGCGGTATTTGCACAAAATGGTCTTAAGGGATCAGATGCAGGTACATCTTTAAAAACAATGT 
TAAT G AGGTTAAAT C CT T CAAC AAAAGAAGC AT ATAAC AAAAT GAGAGAT T TAG G ACT TAT TACTTATAAT G CAC AG G C A 

GGTTTTGATTTCTTAGTTAAAAACGGTATTCAACCAGCTTCCAGAAATGTAGGGGATATAGAAGTAGCTTTAGAACAATA 
TGT^ATGAAAACAGAAGGTGTAACGAAATGGAATGATAAATGTGATACAACGTTTCGCGAATTAGCAACAAGTTCGGCAT 
TTTTATCATCAAAATTCTATGATCAACAGGGGCATATTCAAAGTCTAGAAAATATTTCAGGTACACTTCATGAATCGATG 
AAAGATTTAACAGACCAACAACGAAGTATGGCTCTGGAAACATTATTTGGTTCCGATGCTGTACGTGGTGCGACTATCTT 
GTTTA7\AGAAGGCGCCAAAGGTGTCAATGAAATGTGGGATTCCATGTCAAAGGTTACAGCAGCTGATGTAGCAGCGACCA 
AAATTGATACTTTAAAGGGACGACTTACATTACTAGATTCAGCGTTTTCCACAATGAAAAAGACAATTGGTGATGCACTA 
GCTCCAGTAGTTAGTGTTTTTGTTGCTGGTTTACAAAAACTTGTTGATGGATTCAACTCTTTACCTGGACCAGTACAAAA 
GGCAATAGCAATTACAGGTGGTATCGTCCTTGCTTTAACAGCTGTGGCTACAGCAATAGGTGTGGTTTTAGCAGCGTTTG 
GAATGATTGCTTCAGGAATTGGTTCTTTATCTCTTGCTTTAGCATCAGTCGGTGGGATTGCTGGAATTGCGGCTGGAGCA 
GTTGGATTCTTAGGAAGCGCGCTTGCGGTTTTAACAGGGCCAATTGGTCTAGTAGCAGCGGCTCTTATCGGAACTGGTGT 
T GTT GCAT ATAAAGCAT AT C AAAAAG C G ACT GAAGAC AGTAT CG CAT CAGT AGAC C G C TTT G CT ACAAATAC AGAAG G GA 

A^^GTAAGCTCCTCAACAAAGAAGGTTCTTGGCGAGTATTTCAAGCTGTCCGATGGTATTAGACAAAAGTTAACTGAAATT 
AGATT GAACCAT GAAGTAATAACAGAAGAACAGT C GCAAAAGTT GATTGGT CAAT AT GACAAATT AGCT AAT ACAATC AT 

TGAAAAAACCAACGCAAGGCAGCAAAAAGAAATTGAAGGGCTTAAAAAGTTCTTTGCTGATTCGTATGTATTAACCGCTG 
AAGAAGAGAACAAACGAATCGAACAGTTAAATCAGCACTATGAACAAGAAAAGCTAAAAACGCAAGA 

ATTAAAGAGAT CTTACAAACAG C G G CTAGAGAAAAC AGAGAATTAACGACAT C C GAAC GTAT CT CTTT ACAAG CATT G CA 
GGAT GAAAT GGAC AG AGT T GCT GTT GAG CAT AT GT CTAAAAATC AAAT GGAGCAGAAG GTT ATT CTT GAAAATAT GCGTG 

TGCAGGCTAGTGAAATTTCAGCTAGACAGGCAGCGGAAGTTGTAGAGAATAGCGCCAAAGCAAGAGATAAAGTTATTGAA 
GAT G CGAAAAAGACC C GT GAT GAAAAAAT T G CAGAG GCGAT T CG C CAAC GT GAT GAAAATAAAAC AAT C ACT G C T GAT GA 
AGC GAAC G CAAT CAT T G C AGAG G CAAAAC G T CAAT AT GATAGTACAGTTT CT ACAG C T C G AGATAAAC ATAAAGAAAT T G 
T G AGT GAAG CAAAAG C G C AAGCT G GT GAACAT GCAAAT CAG GTAGATT G GGAAAC T G G C CAAGTAAAAT C G AAAT AT CAA 
G CT AT G AAAGAC GAT GT TATT CGAAAAAT GAAAGAAAT GT G GT C GG ACGTT AC CAAC AAAT AT GAAGAT AT GAAAAAC T C 
T G CAAG C AACAAAGTAGAG GAGATAAAAAATAC AGTTT CAAGAAAATTT GAAG AG C AGAAAAAAGCT GTT ACT GATAAGA 
TGTCAGAAATAAAAAGTAGTATTGAAGATAAGTGGAATACAGTTGAAAAGTTTTTCAGTTCTATAAATTTACGTTCCATC 
GGTAAATCAATCATAGjDAGGGCTTGGCAAGGGAATAGATGACGCTTCAGGAGGTCTGTTTAGTAAGGCTGCGGAAATTGC 
AAGT GAT ATTAAGAAGACT ATTT CT G GAG CATTAGAAATTAACAGT CC GTCTAAAGT GAT GATT C CAGT C G GTAG C GC AG 

TTCCAGAAGGTGTTGGGGTTGGTATGGATAAGGGAAAACGATTTGTTGTGGATGCAGCAAAAAATGTAGTCGGAACTGTT 
A AGAAACAAATG GGGAATAT GC C ATCT GTTTT T GAT TT T G GATT CCAAAC AAAT CAAT AT AGT AT C C C G CAAAAT AC AT T 
TAG C GATTTCAGT G GAT ATAT GC AAC C G CAAT TAT C TT ATAACAAT C CATCTAT G G C AAAAACAATATT C C CAAATAG AC 
C AGGT G GAGAACAAGAAC T GAATT TAAC C G TAAAC AT GACTAAT GTTTTAGAT G GAAAAGAG CTT GCAAAC G GAAGT TAC 
ACCTATACTACAAAACTTCAAAATCGTGAACAAAAAAGAAGAGCGGAATTTTAAGGGTGGTGAGCACGTTGGGGAAACTT 
AGTT TT ACTTTTAAT AATA.T T AG AAAAGAT T ATA/TT CAAAT G CTA.GTT GGAAGAAAACGT CCTT C AT GGGCT C C AGTAAA 
AAGAAGATTAGTAAGAGTCCCTCATCGCGCAGGGGCTCTTTTACTTAATACAGAAACGGAGGAACGTCGTATTGACGTTC 
CT CTT G TTAT TAAAG C GAAAAAAGATAT G GCAG ATTT AC AAAAGTTAAAAGAAGATT TAGC G GATT G GTT AT ATAC AGAG 
CAAC CC G CT GAACTTAT TTTT GAT GATGAGTTAGACAGGACTTATTT AGCATTAATT GAT GG TT CT GT CGATTT G GAT GA 
AAT AGT CAAT AGAGGT AGAG GT GTT ATT ACTTTT GTTT GT C CAAT G C CGTATAAATTAG G GAAAACAAATACT CAC AAAT 
T TAC GC AAG AGT G GT CT AC AGAAAC AACTT CTTAT T TTACTAATAAAG GAAGT GT AG AAG CT CCAGC GT TAATT GAAAT G 
ACGGTGAAAAAACCAAGTACCTTTTTAGATGTATGGTTTGGAGAGTATCCGAATAATCGTGATTATTTCAGAATAGGCTA 
C C CT CT GACT GTG GAAGAAAC CAC GGTAC AAGAAC GAGAAAGAG T TAT G TG GGAT GAAAT G G C CAC T C C TAT AG GAT G GA 
CAC C C G TTACT GGACAAT T C GAT GAT AT G AAAG GAAC AGGGAGT T TTAAATCG C GT G GT G GT TAT G C G CT GT ATT GTGAA 
G ATT AC GGAAAAGAT GTAG GATT CT AC G GT G CTATAG C CAAGAAAAACATTC C GG G C G GC C C ATTAC AAG ACTTT GAAAT 
GGA.GGCATGGATGACTTTAAAGTCTAAAAATATAGGTGAAATGGGTCGTGTTG7\AGTTCTTCTTCTAGATGAGGCTAGTA 
AT GT G GTAG C C C G CAT CAAT AT GAAT GAT CTAT AT G CAAC T G C C GAAATTACAAG G G CACATAT GAAAATT G GAAAT AG C 
G GAAC AC C CAATAGTT TT C GAAAATTAGTT GATACAAGT G G G TAT TATT CGAATACATTTAAC CAAT TT C GAG G G C GT TT 
GCGTATCGCTAGGCGGGGGAAGGTGTGGTCTGTATATGTGGCTAAGTTTATAGATGGTACAGAAAAAGATGGCGCTTCGC 
TT GTAGAAC GTT GGATT GAT GAAAC AGGAAAT C CAAT GAC AGAAC GTAAAATT G C AC AAGTTAT GATT G C GAT TT G CAAG 
T G G GAT AAT CACC AG CCT G T TAAT GAAAT AC AAAT T GAT G ATTT G A/VAT TTT G GAAG GTAAACAAAGT T C CAT CT AAT GC 
ACAAC CAT ATATCTTTGATACTGGA GAT AAAATTGTTATCGATACTGAGAAAAGTCTTGTCA CAAT CAAC GGGAAGAATG 
CAAT CAAT ATAAAAGAAAT CTT T AGTAATTTT C CT GT CATAATAC GT G GTGACAAT C GTAT C GAT AT TAT GCCGC C AGAT 
GTAAACGCAACAATCAGTTATAGGGAGAGATATAGATGAGAACACCAAGCGGGATTTTGCATGTTGTGGATTTTAAAACA 
GAT CAAAT C GT CG CAGCT AT C CAAC CAGAG GACT ATT G G GAT GACAAAC GG C ATT G G GAAC TTAAAAAT AAT GTT GACAT 
GTTGGATTTCACCGCATTTGATGGAACAGACCATGCAGTTACCTTACAACAACAGAATCTTGTTTTGAAAGAAGTTCGCG 
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ATGGAAGAATCGTACCATATGTTATTACAGAGACTGAAAAAAATTCCGATACACGATCTATTACCACATATGCTTCAGGA 

GCTTGGATTCAAATTGCGAAATCAGGGATTATAAAAC CACAACGGATAGAGAGTAAGACGGTTAATGAGTTTATGGATTT 
AGCACTCTTAGGTATG7VA.GTGGAAACGCGGAATTACTGAATATGCTGGATTTCATACAATGACCATCGATGAATATATTG 
ACCCACTCACTTTTTTAAAGAAGATTGCATCTTTATTTAAACTGGA7\ATTCGATATCGTGTTGAGATTAAAGGTTCAAGA 
ATCATCGGTTGGTATGTAGATATGATTCAAAAACGTGGTCATGATACAGGCAAAGAAATAGAATTAGGAAAAGATTTAGT 
CGGTGTTACGCGAATTG7UVCATACACGTAATATTTGCTCTGCTTTAGTTGGATTTGTAAAAGGTGAAGGTGACAAAGTAA 
T C AC T ATT GAAAG CATT AAT AAAG GT CTACC CT AT AT C GT AG AT G CAG AT G CGTTT CAAAGAT G GAAT GAAC AC G GAC AA 
CATAAATT C G GT TT TTATAC ACC AGAAACAG AAGAATTAGACAT GACT C C AAAAC GTTT ACT GAC G CTTAT G G AAATAGA 
ATTGAAAAAGCGTGTC7\ACTCTTCAATTTCTTATGAAGTGGAAGCACAATCTATTGGTCGTATTTTCGGTCTAGAACACG 
AATTAATTAACGAAGGCGACACGATTAAAATTAAAGATACAGGGTTTACACCAGAATTATATCTTGAAGCGCGAGTAATA 

GCT G GG GAT GAATCTTTTACAGATT CAACGCAAGATAAATATGAATT CGGAGATTATC GT GAGATAGT TAAT CAAAAT GA 
GGAATTAAGAAAAATTTATAATAGAATCCTTAGTTCGCTTGGTAATAAACAAGAAATGATAGATCAGCTAGACAGATTAG 
TTCAAGAAGCTAACGAAACCGCTAGTAATGCAAAGAAGGAGTCAGAAGCAGCAAAAACACTAGCTGAAAAAGTACAAGAA 
AATATTAAAAATAATACCGTTGAAATTATAGAATCTAAGAATCCACCGACAACAGGTCTTAAACCATTTAAAACGCTTTG 
GCGTGATATTAGTATCGGAAAGCCTGGTATTTTAAAAATATGGACAGGTACAGCGTGGGAATCGGTTGTACCTGATGTTG 

AAT CT GTAAAAAAAGAAACATTAGAT CAGGTTAATAAAGAT AT CGCAACC ACAAAAACAGAGTTAAAT CAAAAGGTT CAA 
GAAGCCCAGAACCAAGCGACTGGTCAATTCAATGAAGTGAAAGAGAGTTTACAAGGCGTTAGTCGTACGATTTCTAATGT 
TGAGAACAAACAAGGTGAAATCGATAAGAAGATTACTAAGTTTGAACAAGATTCAAGTGGATTTAAAACTTCAATTGAAT 

C GTT AAC G AAAAAAGATACT GAAATT AG T AATAAAT TAAAT ACAGTT G AGT CTACT GT G GAAG G T AC GAAAAAG AC G ATA 
TCTG AG GT ACAG CAAACAACT AAT GATT TAAAGAAAAAAACT ACT GAAAT AGAAGAGAAGGCTGG 

ACTTACAAGTTTAGAGACAAGAGAAGTTAATGTTCGAAACTATGTAATTAACTCTGATTTTTCGAATGTTACT^lAATTCTT 
GGATTGGAATTACTAATGCAACTCTTTTTAAATTTGTAGATGTGAATATTTCGGAAGCCTCCGCTATTAAGAAAGGTTTA 
CAAATAACAAGTAATAAAGCTTTTGTTTATCAGAAGTTACCCGCAGACGTGTTTAAAAAGAAGAAGGGGATAGCTTCTTG 

TTATATAAAT GTAT CAAGTTTT ACACCT GGT ACAG ATTAT C CAC GTTTATATATGAGATTCACCTAT GACCAAAAC GGAA 
CAG AAAAAC AAT AT TAT GC CAT TTTAAAACAAC AAGAAGT AACTAAT G GAT G GAT TAG GAT T T CT ATAC CAT TT GAT AC A 
ACT G GATATACAGGTGAATTAAAAGAAGTACGTGTAAATATAGCTACCGCTGACACAACTACTAT C GAT GCAAC GTT CAC 
T G GAAT AAT G GT T ACATT CGGT GACT TAAT T GAAT CTT GGAAT CT C G CT CCAGAAGAT G GAGTAAC ACAAG GT GTTTT T C 
AATCTAAAACAACCGAGATTGAAAAAAGTGTGGATGGTGTAAAAACTACTGTAACAAATGTTCAA 

TT T GAAAAG C G CAT GT CTAAT GT G GAACAAAC AG CAACT G GATT AT CTT CT ACCGT AAGTAATTT AAACAAT GTAGT AT C 
CGATCAAG GAAAAAAGCTTACTGAAGCAAATACAAAACTCGAACAGCAAGCAACCGCGATTGGAGCAAAAGTTGAGCTTA 
AACAAGT AGAGGAT TAT GTT G CT G GGTTTAAGAT T C CT G AGT T GAAACAAACAGTT GATAAAAATAAACAAGATTT AT T A 
GAT GAATTAGCCAATAAGCTT GCAACTGAACAATTTAACC AGAAGATGACT CTGATT GATAACCGTTT CACTATTAAT GA 
ACAG GGT AT CAAT G CC GCAG C AAAAAAGAC AGAAGT AT AT ACAAAGAC G CAAGC AGAT G G ACAAT T T GCTACAGATT CTT 
AT GTAAGAGATATGGAGT CGCGC CT GCAGCTAACAGAAAAGGGT GTTAGCATATCTGTAAAAGAAAAT GATGTAATCGCA 
GC CATTAACATGAGTAAAGAAAACATTAAGTTAAAT GCTGCACGAATAGATTTAGTTGGT7VAAGTTAAT GCGGAGT GGAT 

TAAAGCT G GATT G CT GAGCGGTT GC CAAAT T AGAACAT CAAAT AC G GATAACTAT GTTAGT T TAGAT GAT CAATTT ATAC 
GTCT CTAT GAAAGA.GGAGTTGCTAGAGCATTT CT GGG GCATTACAGAAGAT CAGAT GGT GCAG TAC AAC C GACT TT CAT C 
TTAGGTTCAGATGAAAAGACTAACGCTCCGGAAGGTACTTTGTTTATGTCTCAAGCAGGTGCAGGATGGTCAGGGGCTTA 
TGCGAGCATTGGTATTAGCAATGGCATAGTTGATGGTGCAGTCCAAAAGTCTGTGTATTGGGAGTTGCAAAGA^vACGGAC 
TAAGTGTTCT7W\CGCT7^TGATTACCATGTTTTTTACGCTGGAAATGGA7^ATTGGTATTTCAGAAGAGGGAAACCAGGG 

T T GT AT C AAACTT C GTTAGTCGT T GAAGATAAT AGT AC AGAT T CT GAT TTAAGAT TAC CTAAT GT AACT ATAC GTAAT AG 
CCGTGCAGCAGGATATACAGGAGTTATTCAATTG7\AATCCCCTGTTACTCAAAATGGATGGGGTGCTGTTCAAGGGAATT 

T TAT GACT C CTT CAT TACGGGAGTATAAAT CTAAT AT C CGT GAT AT T T CTTTTTC C G C CTT AGAAAAAATT AGAAGT CTT 
AAAATT AG AC AAT T TAATTATAAGAAT G CT GTAAAC GAAC TAT ACC GGAT GAGAGAAGAG AAAAGT C CCAAT GAT C CAC C 
ATT G AC AACAG AAGATATTAAAACATACT AC GGTTTAAT C GT AGAT GAAT GTGAT GAAAT GT T T GT G GAT GAAAGT G G GA 
AAG GAAT T CATTT GTACT CAT AC G CAT C CAT T G GAAT TAAAG GTTTAC AAGAAGTT GAT G CAACAGTACAG GAAC AG GAG 
GTAGAAAT AG CAAAT C TAAAAT C ACAAATAG CTAGT C AAGAAGAT C G GAT AG CAC GATT AGAAGAATT ATT ACT AC AACA 
AT TAAT AAAT AAG AAAC CAGAG C AGC C ATAG GCTGGTCTTT TTATTT T G GCCAAAAAG GAGAG GAAAAGAT G GAT C GTAT 
TGATGTATTACTAAAAGCATTTATAGCTGCGTTTGGTGGCTTCTGTGGGTATTTCTTGGGAGGATGGGATGCAACATTGA 

AAAT CTT AGT GAC AAT G GTAGTT ATT GAT TAT TT AACT G G CAT GATT G C AG CAG G GTAT AAC G GAG AATTAAAAAG CAAA 
GTTGGTTTCAAAGGCATCGCCAAAAAGGTGGTGCTTTTTCTTTTGGTCGGAGCGGCCGCTCAACTAGACTCGGCACTTGG 

AAGCAACAGTGCAATCCGTGAAGCAACAATTTTCTTCTTCATGGGTAATGAATTACTTTCACTCTTAGAAAATGCCGGGC 

GAATGGGTATTCCACTCCCACAAGCATTAACAAATGCAGTTGAGATTTTAGGTGGTAAACAAAAACAAGAAGAGAAAAAA 

G GAG ATGT T CAGT AAT G GAAAT C CAAAAAAAAT TAGTT GAT C CAAGTAAGTAT G GTACAAAGT GT C C G TAT ACAAT GAAG 
CCTAAATATAT CACTGTTCACAACACATA.TAAT GAT GCT CCAGCTGAAAATGAAGTGAGTTACATGATTAGTAACAATAA 
TGAGGTGTCGTTTCATATTGCAGTAGATGACAAGAAAGCGATTCAAGGTATTCCGTTGGAACGTAATGCATGGGCTTGCG 
G AG AC G G CAAT GGTT CG G GGAAT CGT CAAT C CATTTCTGTAGAAATCT GTT ATT CAAAAT CAG GAG GAGAT AGAT AC TAT 



4/37 



WO 2005/089527 



PCT/US2005/009928 



AAAGCTGAGGATAATGCTGTTGATGTTGTACGACAACTTATGTCTATGTACAATATTCCGATTGAAAATGTTCGAACTCA 
T C AATCCTGGTCAG GTAAATATTGT C C G CATAGAATGTT AGCTGAGGGAAGGTGGGGAGCATT CATT CAGAAGGTTAAGA 
ATGGGAATGTGGCGACTACTTCACCAACAAAACAAAACATCATCCAATCAGGGGCTTTCTCACCGTATGAAACCCCTGAT 
GTTATGGGAGCATTAACGTCACTTAAAATGACAGCTGATTTTATCTTACAATCGGATGGATTAACTTATTTTATTTCCAA 
AC C G ACTT CAG AT G C AC AACT AAAAG C AAT G AAAG AAT AC C T T G AC C GTAAAG GT T GGT G GT AT G AAG TT AAAT AAAACA 
AAAGAATAGTTTTATGAACAAAAATAAGAGCCGTCCTGTTGGGCGGCTTTTTTTTATTGCTCAATTACTGTTGCACTAAT 
TTTAGGCATTCCTGTTTTATCTTTTTCGTCGTAGGCGCCATAGATTGTTACTATTGATCCTTTAGATATTTTTAATCCGT 
TTTTAAGTGTTATTTCATTTTCGTTCGTTTGCACTCCACTTTGGACAATTTGAATAGTGTACATGCCTTTGCCGTCATTT 
TCGTTTGTGCTTATGACAAATGAAGGTAATGCTGAAGACTTAAGTAATAAATCTACCGTTCCGGTAGCTTTAAGCCTTTT 
TCCTTTTTCGTATTGATCTCCATTTGCCTTAACAAAACTAACTTCTTCAGCATCTTGCTTTATCTTCTTATTTAACTCAT 
CCTGAGATGTTAAATCTTTTTTAGTTTCTGGTTGAGATTTGACGTTCGTTTTTTCACTTGATTCACTTTGTTTAGAAGAA 
TCACAAGCTGTTAGACCTAACAATAAGGTACTTCCAATGCAAATACTTATAAGTTTTTTATACATTTTCATTCTCCTCCT 
CTATCCAAATTTCTTCCATGTGCAATTTTAATTCCTTTGCAATTTTATAGGCTGTAAGAAAGGTAGGTAGCGTCGTGTTA 
TTAACGAGTGAACTCATTGTAGTTTGACTAATTCCAATAAGTTTTGAAAACTCCTTTTGACGTATTTCTCTTTCAGCAAA 
AATAACACGAAGTTTACATTTTAATCGC ACAATAT C AC CT CTTTAATTATATACAATT C GCATATG G AAAT GTGT C CT CC 
T T TAATTT AAT CAACG AAC ATT T AG AAAAGT T T AAAT G GAC AG G C AAT AT AACT C TTT CTAAGT CAT AT AC CT AT AT C AA 
GAC C AC GAG GAATACCAAGTGGAAC T AAGGAC AT CAAGAGG G GAG AG G ATT ACAT G CGTT GG CAGTAT AAT C ACT T GAAT 
ACAACT C CAT AT CT T C AT C CAT CC AAAG AAT TAT GTT CAAT GTACAAT G GAT CGAGAT C AAGAG C AG AGAC G GAAT CAAT 
TTT AAAT CACAT GAAAAAT CATGAAGTTTAT GAT C GAAAAGAATATAAAGGAT ATTT C AGTT T GT C ACAG GT ATT AG AAG 
AAGAT CTAT AT G GAGAG GAlAGAAGAT GTTTT AAACT G G GAAAT T CT AAT GGATT GTTAT GAT GT AGT T CT TACAAGAAAA 
G GTATT GCATTT C GT GAlAAAAGAAG AG GAG G AAC AAG CAT GAC TCTTGCTG GAGAAG C GATT ATT AT TT G GAC G GC AAC A 
GGGTTGTCAGTAGTTGCAATGAAGGCAGCAGAAAAAATGGGGAAAAGTGTTCCACATTGGCTTCCACGTGTCACTTTGTA 
CACAACACTTACAGGCTCGTTT CTATACCTT CTACGTTAT GTT CTCGTTTT ATTT CTAT GAAGGAATACGATGTGG7\AAC 
TTTTCATTCCTTATGTCATAAGGAGTTTAGCTTGTATGCACGTATTCCTTGAAACAGGGATATATACCCTCTATAAGAGG 
GATATAAG GAGT G AT TTTATGCTGGAGTT GT T AT CAGTACCAT T C G CAG GTT TAATTTT C G C CAT AGTT G G C GAAAGG CT 
C AAAGGAAGAGAGAGT GAT CGAAAGAAAAT ACAAGTTT T TT TT GAAGT AAGCGGAATT G C GAT AC GT AGAGAGGACAAAT 
T ACAGTAT C CAGTTT TT CTT GAAC AAAAAGAG GAT GAC C GAAGT ACAACTTATATATAT C G GTT GC CT GT AG GAAT GC CG 
AGTAAAATTATT CAGAAG GT CGAG GAT GTT GT CT CT GAAGGGCTAAGT AAAC CT GTC C GAATT GAT TAT GATAATTACAA 
GCTAAATATTCGTGTGTTTCATAGGGATATACCGAAAAAATGGTCATGGTCTAAAGGTTTGGTTGCAGAAGGAAGCTGGT 

GT GTT CCAAT GG G C CAAAGTTTAGAAAAACT TAT CTAT CAT GAT TT T GATAAAACACCAC AT AT GAC ACTAG GT G GT C T G 
ACAC G GAT G GGAAAAAC GGTATTT TTAAAAAAT GT AGT T ACTT CT CT T ACTTTAG CAC AAC CAGAAC AT AT TAAT TT ATA 
CATTATTGATTTAAAAGGGGGCTTGGAGTTTGGGCCGTATAAGAATTTAAAACAGGTAGTTTCTATTGCTGAAAAGCCCG 

CAGAAGCTTTTATGATATTAACTAATATCCT CAAGAAGATGGAAGAGAAAATGGAATATAT GAAAT GT AGAC AT TAT AC G 
AATGTTGTAGAAACAAATATCAAAGAGCGTTACTTCATAATAGTAGACG7\AGGAGCCGAACTTTGCCCAGATAAAAGTAT 
GAAAAAAGAACAGCAAAGGTTATTAGGAGCGTGTCAACAAATGCTCTCTCATATAGCGCGCATAGGTGGTGCTTTAGGTT 
T T AGATT GAT T T TTT GT AC ACAGT ACCCGACAG G G G ATACATTAC C GC G CCAAGT AAAACAAAATAGT GAT G C GAAATTA 
G G CT TTAGATT AC C GACT CAAACAGC AT CAAGT GTT GTT AT AGAT GAAG CG GGATTAGAAACGATAAAAAGC AT T C C C GG 
ACGCGCGATTTTCAAAACCGATAGACTTACAGAAATACAAGTGCCTTACATTAGTAATGAGATGATGTGGGAGCATTTAA 

AAG GATAT GAGGT G GAGAAAC AT GAG GAT GCAAAC G CAT AT G CAAAT CAACCGTCAAAT GG C GAT AC TT G C GAC GATT AG 
AAAG CTACAGTTT G CAAC GAGAAG G CATT TAAT GAGT ATT CAT GAAAT G G GT G GAATAAGAAAT G CAAAT CGAATT CT GA 
AAGATTTAT CTATTT ATACAAGTAAG GT AGT TT ACAATAAAGAG CAT GTATATTATTTAAAC CAAT C AGGACATAAGTT G 
TTTGGCGAAGGGAAAGTTGTACATCATGGTAAAGTTACACACGCTCTTTTACGTAATGAAGCTTGGTTAAATTTATATTG 

T CCT GAT GATT G GC AAGTAGAAACT GAAATT AAATAT AT AAAG G ATAATAAAAAGAAAAAAATAAT T C C AGATGT GAAAT 
TT C GTGAT GAGGAC AGAATACTT CAT G C T GT AGAAATAGAT CGT ACT CAGAAAAT GAT AGT GAAC GAT GAAAAATTAAAA 
AAATAT GAG GAGT TAAC G CAGATTTATAAAC AGAAGCAT AAC G GGAAAGTG C CAGTT ATT C ATT T C TTT AC AAT CAC AAA 
AT AT AGAGAAAAGAAATTAGAAGAACT GG CAAAT AAATAT AAT GTGTTT GTAAAAGTATATGTAAT C GCTACTACTTAAT 
GATGAAAAAAAGAGCTGATCATTTTCGAATGATTAGCTCTTTTTTATGTATTGTATTACGTCGTCTATTTTGTAAATTTT 
ATTAATTCCTTTTTCTGCAGCAATGGCATTTAAAGCATCAATGATAGCTTCAAGCGAATCAAAACGAACAGCATTAGCAT 
TACCATT CACT AAAT CAC TAATCGTGTTGTATCTTACTTGGGATTCTGTAGATAATTTATTTTTAGTGATCCCCAATTCA 
TCTAAAGAATTTCCGAGTGTGAATTTCATTTTATTCTCCTCCGCAGCACTGGTTATCTTGTACTCATTTTACAACATCAA 
TCGAAATTAGTAAAACTTTTTTCGTTCAACTATTGACGTTGAATAATTAGAGAGTTATAATTCAACTTAAAAGGAGGAAC 
AATTATGAATCGAGTAAATGATTATTTTGGTTTAGAAAGTAAATCAGATTGCATTTGGTTTTATGGTTTCTTCAGTATAT 
C T AC GATTTTAT TT T TAATCGATAT GAT TAT T G CT CTT AT ATAAGGAGG GGAGAAAAT G CTTAG CT CAG CAAAC TAT AC G 
CAATATAAAAAATTACAATCATTCC GAT CAGTAGAAGAGAT GAAT GAAGCGATTTGTT CTTTTTTATACAAACATACACA 
TGAATTATCCGAATCAGCAATAAAAGTATTGAAATTTCTAGCAAGGCACTCTTGTAAAATCCCAGGTGTCTCTTTCTTGA 

AG GTAG G G ACAATT G C G G AGG CATT AAATATAlAGT GAT C GAACT GTT C G C AGGGTACTAAAAGTAT T AGAG GAT TTT GAA 
GT AGTAACTAGAC ATAAAACAAT T C GAAC GGAAG G AAAATTAC G T GGAG GGAACG GAC AT AAC GT CTAT GT C CTT CTAAA 
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AACAAAACAAGATACAAGAAAT GAAT TTAGAG GAAT GAT GT TTT GTAAAC AT T GT G GT GAG C CAAT T ACAG CTAAGTAT T 
C AG G TAG ATAC GCAAAAG GAAGTAAAAAAG AG TG G G TAT AT AT GAAAT GCAGTAAT TATATT AGATT C AAT C G CT G C GT T 




AGAAGCTTTTT CAAT G CT C GAT GAAGAAAAAGAT AT G CAT GAGGTTTTTAAAACTT TAATAAAGAAAAT CAC ACTTAGT A 
AG GAT AAGTATAT CG AC AT C GAATAT AC ATTT T CT T T ATAGTT TTAAAGT T G GTT AT TAGT T ACTGT GAT ACTAC C T G G A 

GTAACACCGATAGCTTGTCCAAGATCATGTTGTGTTAAATTCCGTTCTCTTCGTAATTGACGTAACCGATCTTTAAATTC 
CACAATAATCACCTC ATAAGT GGTTT GTTAGGATTATTATAATATTTC CTAAAGGGAAAATCAAT CCGAGTTATTTCTAA 
G AATAAT ATAAAATAT GT GTAAAAAT ATAT CT T GAATT T T C C CTAAG GGAAT GT TAAG GT GATTT ACAAAG AT AT AGAAA 
GGAGTTAC CACATGAAAGTAATTAAAGACGAGACAAAATTAAAAGCTGCATTCAAAAAAT CTGGGTATAAGTAT CAAGAG 
TTAGCT GACGAATTAGAAATATCCTGCAGCTACT GTTACAAG CTAATTAACAATCATAATTACAAAAAGAAAATATC GTA 
T AAC TTAGCATC CAGAAT GG C G CAT GTATTAAAT G CAAGT GTAGTT GATT T GT T T GAAGAGCAAGT C GATTT TTTT T AAT 
AC CAAT ATT C C CTGAG GGAAC ATAG G G GT GAG AGG GCCAT GT C AGAAATT TATT ACAAAGG GTT TAT CAT CAAG GAAACT 
TAT GGC GAAAGAAAT AT C GAAGAAGT GTTTAAAGAAG CAT AT GAGT CATT T TAT G G G GT T GAAGTTAAGGT T GTTAAAAA 
G GAATTAG GGACTAAAC G CAATAGT GCAG C C AGC TAAT CTTTAAACT TCAGT GAGAACATT CAAT GAAGT C GATTATAAA 
AT GGAC AAG C CT GAAAG GAGAGAAAT GAAT G AAAAACG G GAAAAG GTT GACTAAAC GT GAAAAAAT G CAT CT TAAAT CAT 
AT AG CTTAAAT CCT GATAATT G GTT G GTT TT CAAGAAAG CG GAT G GAGAAAT GCAT TT AGTAC ACC GT TATACTAGCAC A 
ACT C GT GTAAT T C CAAGTTTATAAGTT T AG GAGG GAATAAGAT GGAT CAGT TAACAGTAG CAAGT GAAT TAC GT CT TT TA 
G G GAGAAGAAAAGTAG CT G GATAT GAAT T TACT GGAAT C GAGG GAG GATTT G GT GAAGGTAAAAAAG CAAT GT T G GTT T T 
GGAT ATAGCTACAATTCAT AAC CAAC CAT TAAAAGAAAT CAAT C GT CG CATT AAT GAT AAT C GCAT T C GATTT AAAGATG 

GTGTGGATATT GTT GATTT GAAAAGTGGTGGCTTTAACCCACCACAATTATTAAACCTTGGTTTCTCAAATATGCAGATA 
GC GAAAT CAAATAAC ATCTAC CT T CT AT CAGAACGAG G TTAC GCAAAACTATTAAAAATT CT C GAAGAT GAT AAAG C T T G 
G GAATTATAC GACATATTAGTT GAT GAGT ACTT CAAC AT GAGAGAAAAGAAT CAAGT G GCTACAG AT CC AAT GAGT ATTT 
TAAAAC TTAC ATT C GAAG CAT TAGAAG G C CAG CAGC AAGCAAT CGAAGAGATAAAGT C G GAT GTACAAGACTT GAGAGAA 
AATACAC CATTATTT G CAAT T GAAT GT G AT GAAAT CT C T ACAG CT GTAAAAC GT CAAG GAGT CAT ATT GTTAG GT G GAAA 
AC AGT CTAAT GC CT AT C GAAAT C GT G GAT TAAG AGG GAAAGT T TAT C GT GATAT CT ACAAC CAACTATACC GT GAAT T C G 
GAGT GAAAAGT C ACAAAG CAAT TAAAC GT T GT CACT TAAAT GTAG C AGTAAAAATAGTT GAAGAAT ATACACTT C CAATT 
GT ATT GAG CGAAGAGATTTCTTTTGTAAATGCACAAAT GGAT TTT AC AGAAAT GTAGTT AGT T7\A7^AC AT TCT CAAC CGG 

TT TTT TTCTAAGTTAAAAATTT AAAG AAAAGGTG GAAAAG ACAATGGACCAGTTACGTGTTATT GAG GGAGAAAAAGTGG 
ATAAGCCAGATTATGTT GAGATATACCTT GGAGCATTTAT GAATGCAGTT AAT GAGTTAAAGAAACAGGAT GAGGAAACG 

AGATCATTAAGCAAGGATACGTATAAAAAAGCAATTTTTTATGGAGTTAGATACATTTCAATATCAAAAAATGACAGTTT 
GAATTAT GAC TAC CTAAT GAAT AG AT TT CTTT TAATAAGC T AT TTAGAAAAT TT GAT GAAG G T GTT GAC GC CTAG G GATT 
T TAT GAC CATATT C C CAAT C GAT AAAAATTAT G ATG GCGCTCGT TAT GAAAT GAAAGAT TACTTTTT TAC CAT GAAT GAA 
AT TAAAAAAAT C G GAAT G G ATAC AC C TATT GGAGAGAAAAT CAT GG AGTTTTTAT GGG ATTAC CAAAACT TTAAAGATAT 

AACACTATTTAACTTAGCCTCTGTAAGCATTTTAAATAAATTGCAGAAAATGCAAGGTAAAAAAACGTTAACTGAAGAGT 
TT G C C GAG C GATTAG GT ATC GATACT T AC AC GAAG C ATAAAGAAAAG G GT GGAAAAGAATAT ATTACAAAT GAC C GTAC T 
G G T GAG AT C CAAGAAG TTAAAAAAT CT AGAC C AAG ATATT TAAAAC C AG TT C AAT GATT GAT GTTATTAAG GC TTATAAA 
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C AAAGAAA GT AAC TT G C G C CAACAAGTT ACT AAATAAAAAT ACTT ATAAAAAT AT ACT T AT TAG AAAT AT AAC AT ACACA 
CTCGATGTATGGAAAGGGTGTTATTATGGCTCTTTTTAGAAAAGTGCATACAGAATTTTGGACAGACGTAAAAGTATCAG 
AAGATATGACGCCAGAAGACAAATTGTTTATGGTGTACCTTTTAACTAATCCCCATACAACTCAATTGGGAGTATATGAA 
ATCACACCTAAGATGATAGCTTTTGAAATCGGACTATCAATAGAGTCGGCTAGAGCACTATTGGAACGTTTTGAAAACCA 
TCAT AAATTAAT T AAAT AT AACAAACT GACAAGAG AAAT T G CTATAAAAAAT T G G G G C AAATAC AAC CT G AAT AGAG G CG 
GGAAACCAATTGAAGATTGTCTTAAAAGAGAAATTGATAAAGTGAAAGATTTATCTCTAATAAAATTCATTTTAGAACAT 
ACAGATCATGCAGCTTTAAAAAGAAAAATCAATCTTTATGCGGGTTTTGACGATACGTCCCACGATACGTTAGCGATACG 
T G AC C AAG AAG AAG AAAAAGAAC AAAAAAAA G AAC AAAAAG AAG AAC AAG AA G AAAAAG AAAAAG AAAAAG AAAAAC AAA 
AAGAAGAAGAAAAAGAACCAGAAGAAGAAAAAACAAGAAT AAAATCCAAAGCGT CTTT AAAAT CAGAC GCAAAGT CCAAT 
C C AAT AC C GT AT AAAG AT AT ATT G G ATT AC T T GAAT G AAAAAG C AAAT AAAAAT TT C AAT C C T AAAGC AG AAG G AC AT AG 
AA^GTTAATTCGCGCTAGATGGAATGAGGGGTATAAACTAGAGGACTTTAAAAAAGTTATCGATAACAAAACTACGCAAT 
GGTT T G GT AAGAAAAG T TTT GAT GGAAAAC CACT AGAT CAAT T TTTAAGAC C GAG C AC GT TAT T T G C ACAAAAAC AT T TT 
GAC AAC T AC TTAAAT GAAAC G GT CAAC ATAT C CAAT CAAC AAC AT GGAGAT CA G ATT G T TAT AC C TG G ATT T AG G G G GGA 
AATGCCGTTTTAGAAAGGAGTACTAAATGTGAAAAAGATACAAGATTCTTTTGAAAAACTTACTAAGTTAAAATTTGCAG 
AT GAAC AAT GT GAT AAG C AC ACCTTTAATAAAC AT G G G AAAGAAGTTAT TAAAT TAGT T AG GAAAAT GAT T GAT GAT G CA 
GGAACGGTATATTGTCCCCGCTGCATGGTTGAAGAGCAAAATTCAGTTTTATTTCAACAAGCAAATAAT CATTATAAAAA 
GATTAAT AG AGAACGGAAGAAAAATGTACT CTTT CAACACAG CAT C ATAGAAAAT CAAT CC ATT AC AG AAT C AAG ATT GT 
CTACATACAAGACGGATT GT CAAGAAACGAAAGAAAACAAAGAAAAAGCTATAAAAATT CTT GAACGCAT AAAAAAC GGT 
GAGTT TTT AAAT GTATAC ATT GCAGGG ATT CAAG G AGTAG GAAAAAGCCATTT AGC GTAT G C GAT G C T G TAT GAAT TAGT 
TAAAC AC TAT T G G GTAAT AT CAGAC GGT GAGAAATT AAAT GAC GAAC AT G CTT T T AAAAAT AT GAAAAGC T G CTT AT TT G 
T AGAGAT T GAAAAGCT AATT CGATTAAT ACAG CACT CTT TT AGAAAT ATAGAGT CAAAAT AT ACAAT GGATT AT T GTAT C 
AGTTTAATGGTAGATGTGGATTTCCTTGTAATCGATGATTTAGGAGCTGAAAGTGGTTCGATGAATCGAAACGGAGAAGC 
AAGCGATT TT GT T CATAAAAT ACTT TAT GGT GTTACAAAT G GAC G G CAAG GAG C AAAT AAAAC AAC AATT ACAACTT CAA 
ATCT GT CAAGCGCTCAATTATTT CAAAAATAC GAT C CGAAACTAGCAAGTAGATT GTTAAACG GTGTAT CGAAAGAT GAA 
ACAATT GT TTT TAAAACAAC CAC T GACAAAC GAAT T GTAAAT TT AG AC ATT G GAT T CTAATAAAAG GG GT GC G GAGAAAT 
GAAAGAG GTAAAGGG GAAAAACAC CAAAT T AAT GGAAGAAT TT G AC GT GTT ATTAAGAC AACT G CT GATT AAAT CTAAAA 
C AGAT GAAAG G GTAAAAAACTTT TT GG AT GAT CT GT T T GAAAT GCT AAGT GAT AAT AAG C T GC AGT CT GAT ATT GATT T C 
AAAAC AG C ATTAAATAAGT TAAGAGAAAAG C ACTT T C CTAAGTTT GAT AAAG GAGAG AG CAAAAAT G ACT AAAG AAAAGG 
GACAAGCT AAG GAAGTAGTTAAT GTT C GT G GAAT GT CAG AT GAT GAGT TT ATAGAGAAATACG GAAGGCT T GTAC AT CAT 
T G C G TAT G GAAAAGAT AT G C GAAAAAAAAG G C C AGTAT AG AG CGT GATAC C GGT TT AGATATT GAG G ATT TAACACAAT T 
CGGAATGATCGGTTTGATAAAGGCGCGAGATAATTTTGACCTTGAATTTGGATGTGCGTTTTCAACGTATGCTGTTCCGA 
AAAT T ATT GG GGAAATAG GAAGGG CAATT C G G GATAACCAAAA/^ATAAAAGTTC AAAGAAC CGT AT AT G G C GTAAAAG GA 
AAGATTTTAAATCAACAGTTAGCAGATAAAGAACCAGAAGAAATAGCAGACATTT T GGAT GAGT CAGTATCTTTAGTAAA 
GAC G G CT TT AGAGTAT CAAC CAAG C ACAGATT CACT CAATAAGGTTGT ATAT GC AT CT GGAG CTAAT GAAGAACT GAC AT 
TAGAAAGAATGATAGAGGAT ACTAAAACGGAAGACATTGAAGAAACAACCATTAAT CGAG CTGT GATAAGAGAATTTAAA 
GCT G CAT T G C CT CCT AAAG AAT ATAT C GTT T T AGAT AT G C GTTT ACAAAAT AT GAC G C AAC AAAACAT T GCAAAT CAAAT 
GGGATACAGTCAGGTACAAATTAGCCGTATATTAGCAAAGATTAATCAAAGAGCTGCTCAATTTGGTAAAGAAGGAGGGC 
TT CAAG ATT GAGT GTT ACAAAAG GT GTTT G TAT C GAT G TAG AT CACT CAG AT TT G CT AC AT GAGAAAGT AG AGTACT TTT 
TAT T C C C T G CT AAAC CAAGT C ATTACTAT GT AAG C AGAT T T AAT C GTAAAG GAGC G CAT TTT G GTT GTTAT CAAGCT GAA 
AG GTTT CAAAT CAC G GAAAAG GAAGTATG GAC AC C AGAAC CT CAACC GAATCT G CCT GAGT T GAAT AC AAG CTTATT CTA 
TAGAG CT C AGT T G ATT T G G C GAAAAAAGG G GTAT AAAGAT AAAC CACTTAAAGACT AC AT CGT AC AGC CGAGAGGGAAAC 
ATTGCTACTTTTGGCATGATCGGGAGCG7\AAGAAATTTTGTGGCTGTTTTCCGCTACATTGGTTTACCGATTTTGTACCA 
GT T C AAAGT CAT C AT ATAGAAGAAAAAACT AGAGAAG AGGTTAAGT T AT TAC AAC G G C C AGAT G GAC AACT T G CATT T TT 
TTAACGAAAGAAAGTGAATGGGCGTTTTACCCAGTCATCGATTT7\AAAAAAGGAGTGTTCGTAATGGATATTAAAAAGTT 
ATTTGCAATGCAGAACATTTTGGATAAAAGAGTTTTAGAGTCAAAAAATCTTTCTAGAGGAG^AlGTATTCGAATTTAGAA 
TACTAGCGTTTTTAGATGAATTAGGCGAATGCATGAAGGAATGGCGAGTATTTAAGTTTTGGAGCGACGATCGTAAACCG 
AGAACTAGCATACCTACAGGGGAAATCATAGTACTAGATGATGGTTATGAAGTAG7VAGTTTATAAAAACCCTTTACTTGA 
GGAATATGTGGACGGACTACATTTTGCAATTGGACTTTGCATAGATTTGAAAACAGAAATTAACTTTCCTGCTTCTATGC 

GTT G C GAGAC AGT TAC AGAG CAATTT TT C G AATT GTAT CAT C TAGCAATAC GAT T AAAAGAAGAAC C GAC AG CAT T TAG G 
GCAGATGTTCTTTTATCCCATTATCTTGGTTTAGGGGAATTGTTGTGCTTTTCGTTAGAAG7VAATTGGACATGAGTACAT 
T GAGAAAAACAAAAT CAAT CAT GAA CGT C AAAGT AAT GGAT ACTAAT ACAAT TT GAAT TT T G TTAAGAAAT GAG G GT GAT 
T GAAAT AAGTT G GT G G GCAATAG C GAT CG GTT TAT AT CT ATT GATT GGAGTT G CAT TACTT ATAT G GAT AAT C GCAAC GG 
ATAGTTGGGGTTCGTTATTCTTATATCCTGTTTTTGCGGTAGTCATTGTTTTGGGATGGCTTCCATTAATGATAAGAAGC 
ATTGTACAAGAGATATCT7\AAGCGATTCATAAGTGGAAAAGAAAGCAGAAAACTGAATAGAAGTATTATTTCAGGGAGGG 
AG AAT AAAT G AT TT AT G AAGT TAC AG ATT ATT GC AGT CAGT GT GAT AG AAAAAT AGAG AAT TGC GATT GCT GTT GTAAT A 
AGT GT GAT GAGT GGTT G CAC GAT T GTAAAT GTAAAG ATAAAT AAG CAAAAAAG GG G AAT GAAAGATAT GAAAT G GAT GTA 
CAAC CTT GAT AG CAAT AAT G AGAT T T G GACAAGC GAT AAATTTG AAAT GAAAG AAG AAG CT ATT CAAGCAG CTTTAAAAG 
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ATTGGACAGATAAAATGGTAGCGGATAGAGCGGCAGTCGATAATGAATTCCAAATTGGACAATTCAAACAGTATTCTCCA 

TGGATCAATGCAGATGTATTGTTGGATGAATTGTATGAACGAGCAACCGATGAATGTQGAGAGGTTGCGGAATATTGGCT 

TTCAGGTGTGCCGATGGACGAAGGGGAAAAGCTTCAAGAACAAATTAAT7VAGGTAGTTACAGAATGGCTAAAAGGAATAA 

AT GAGCATCCTAGCTTTGGTTCAATTGAAAATATTGAAACGATAGATGCTAGCAAAATTGAATATAAAGAAAACTAAACA 

AAAGCGTTATTTGATAAAAAATAAGAAAGCCCTAGCTTTCTTATTATATGTAAAAAGTCATATGTTTTTTATCTTCTTTA 

TAGTACTCTAACCGGTTTTGCAAAGTGCCAGTGTGGAACTCAAACTTATGGCCATCTGGATCTGTAAAGTAAAGAGATCT 

TTGGTCTCTCTCATCTCTTTCTCGGCCAGGTAAAATATTAACATCATTTTGAATTAATACTTCTTTTAAATGGTCTAATG 

CTTCATTAGTTACAGTGA7\AGCCATATGTGTATAAGATTGCTTAATTTCATTTCTTGGTATATCTTCTTCAACATTTAAA 

GCAATCCATAATCCATTTAAATCAAAATACGCTAATTTTCTACCTTTTACTAATAATTTTGCTTGAAGTATTTTTTTGAT 

AG AAT TCAAT AGATT TTT C CAAGTTT G AT AC AGAAAAACAAATAT G GT T AAT G C C C T GT AG C ATAAAAAAC G C C C C CT AT 

AATTAAATGATTTTCAATATTTTTATATAAAGATTATAAAAGTTTATGCGCGATTTATAAAGGATTACTACAAAATAGTT 

ATTTGAATTAAAAAGAGCGCCGTTGGAGAGTGCGGTGCTCTTAGACCAAGAACTATAACAGGGATTAAGGAAAGAATATT 

GTATACCAAATTGATAGTAATGCAAGCCATCCAATTGTCAGCGCTATGTATTTTAAAATTTTCATGATTACTCCTTTTAG 

GTATAGAGTGCACCAAGCAAGAGGATGTTATTAATTTTTAAACAAAATGCTTATTTAAAAACTT^AAGAGGGCTTTTTAAA 

GCGCTCCTTAAGAAAAATAAAAAAGAATACCTCATGATACTGTATGTATGTTTTTTTAGGAATGTGAGGATTTAAAACAA 

AATCGTTATTTTATAGATCGGAGTGAAATTCAAATGATTGTTAAAGCGACAATAAAACTTGAATTAGATGATTCGCAG7VA 

AAATTGGGTTTCTTATGTTAGAGAACAAGGTGGAGAAGAAGCGGTATTTCATTATCTGGAAGAAGAAGTGCAG7VAGAAAA 

TT GAATT AGCT GATT TT GT G GAG AT GAAATAC AAAAAT AAGTAATTTAAAC CAAA^ 

TAGCGTGATTAGCTAGCTGTCCTGTTAAGAAAAGAAAACGGTGTTTAGCAAATGTTGCTGTTGTAATTGCGAATTACAAC 
C AT AGTAT GAGC AGAAGTAAAAAT GTT AT GCAAGAAAGT TAAATAAAAACT GC AT TTTATT GAAAAGG G G GAAT G GAT AT 
GTCTCTAGTAGGGAATTTAAAGGAACTCCAAGAAAAAGCCATCGATGAAAAGGTATTGGAATTTGCGGAAGAAATGGAAA 
T C GTAATAAC TAAAAGT G C C G CAAG C G GAT ATT CAG GT CAT AG AT AT AAG ATT CAT AAT G AAAAT C CAAAT C G G CAT AT G 
ATGTGTTCAAAAATATTTATAGAAAAGTTACAAGAATTACTGGACGGTGTGAAGGTTGAATTTAAGGAAGAAGAAAAGAA 
AAATATTTTAGGCGGATCTTACTACGAACATTACATCCGTTTTAAGTGGAATGACTAATTTCTTATTAAAAATTTTATTT 
TGGAGAAAGGGAGTAGAAAGAATGAAAACTTTTAATGTGACTTTTACAGAGTTGAAAATATATGAAGCAGTCATTGAAGC 
G GAGT CAG C GGAAAAGATT AT T GAT GT GATT AAAC ACTTAAAAAGAACT GAAGAT GATT TAGT AGAC AAAG GAGT CAT CA 
T AAAC GAAGT T AGT GAGAT AAAT GTTAGTAAAGAAC AAAAGTT C GAAT AAAT CAACT T CT CAGAT T GTT TATTTT GAG AC 
GGAAACAACTTTCTGAATATCATAAGACCTTATTAGCGAAAAAACTCTTATTCGAGCGTACAAGCCTGTTATACACGTTG 
C AC G GAAATTAGAAT GAATTT GT TAAGGAAG GAAGT ATAAAAAT GAG GG CT T G GAAGAAAAAACAT GT TAAAAGAGC ATT 
T TT GAAT C GT C AAAAG GAAAT T GAT AAAGAAC GGACT GCT GCAG CTT G GAGAAATATTT TT GT GAAAT CAG GAAT CATAA 
AAT AAAAAAGGAAAAG CAACT C GTT GG G GACAAGT C ACTT T T C CAGAT G G CAATGTAAAT C C ATT ATA.G CAAAACAT AT G 
TACAAG CT GT AG CAATAAACAAC GAGAT AT TTT G AC AC CTAT CGACAATTAG AAAT GT G GT T GTT GAT CTAG AAAT AT GA 
A AGTAGGTGAAT CAT CATTT GTTTAACTG GCT GAGAGATTACCAAAAGTTAGAAGAAGACATAGC C TAT CT G GAAT ACAA 
CTTAGATAAGACAAAAGCTGAATTAAGACGCTGGGT GAGT GGT GATT TGAGAGAAGTAC GTT TAACG GCAG AAT CTGAAG 
G T GCAAAAGT T GAAAAC C G CAT T GAAGC GATT GAAT AC GAATT AG CACATAAGAT GAAC GAT AT GTATAAAT TAAAAAAG 
T TAATTAGTAAGTTTAGAG GT TT AGAAAAT C AG AT ACT CAAATT AAAATAT GT GG AT GGTAT G AC GT T AGAAGAAAT AG C 
AGAGGCAGTAAATTATAGTT CTAGT CATAT CAAAAAGAAAC AT G CTGAACT C GTT AGATT AATTAAGTT CGT G GAGC GAG 
AAG GT GTCATTTAG GTT CACT C CTAAAAT GAAT C GAAAC GGTTGAAAAAAT GATT TAT ATT G ATAG C AT ACAATT TT AG C 
AGAAGGG CAACT GGT GCACGGTT GCT CTTTTT GATT TTGGAGGTTATTAGAC GAT GGATGTACAAGAGTTGTCGAGACGA 
TTAGAAAAT CT AGAAC ATAAAGT G CT T CAG GTAGAAAC GAAGG CAGAT GT GCTAAAC CGAAC AG CT AT ACAAAAAGGC GA 
TAAAATAAAAGT G GT GT AT C C G CAT T TAG G GAT AC AAG G C GAGT ATTTAGT G GAGAAAATT GAT AAT G GT GT GT T GGAAT 
T GGT AG CAGAAGAAACAATGAAAAAAATACAGGAGT GATTAGGATTGAAGAAGTTAT CTAAA CAAG AGC TAG CAGCT GTA 
ATGACAC AT T GTATTT CAAC G CTT G GT GAG C AGATT GT TAAT GAG CATATTAATC C C C AGAAGT T G G C GCAAG C AAGT G C 
ACTCCATAACGATCTCTTTGATAATACCACTCCTAAAGAACGTAGGGAAGCGACGATCAGTTTACTAGGGAAAGCGATTG 
ATGAGTTTTTAGAGAGTAAGGAGTGAGGATATGGGAAAGGGATATTTTAATAAGGCTGTATGTTTAGTGTGTGGTCATCA 
AGATAGAGT GAAT CAT C CAT C T AAAAAAGAGTAT C AAG AAGT AAC GGTTT GT C C G GAAT G CAAC GGT G CTTT T GTAGAT G 
T GT G GAAG CTAG GAAAGT ACAAAC GT AATACAC AGT CT AAT GAAGAAC CTT TATT AAC AATTAC AT TAACAGATATAGAT 
G CTAAAC C GAT AGTT C AT T AC AAAG GT GAACAGAT AGAT AGAAAGTTAC GT GT T AC GT T T G ATTG G GAAT CT CAAT C GAT 
T GAT AAAAT TAAT C G G AC AT AC AT T CAT ATT GAAC AT GT AC CAG C C GAT AAC AAAC GTTTAAAT AC C GAGAC C ATT C AG C 
ATAATCATCCTATTGCAAATTVAGGAACAAGTTTAGATGTTGTCCATATTTGTTAATAGGTAAAAGATAAGTGTTTTATCT 

G GAAGT T C AAAC GT GAAT TAAAG AAATTAAAAAAGGAATAT GAAAAG GAGAGT CAC T GAAT GAAC G G GTT TAAT AAAATT 
GT AAAC GATATGCAAAATGAACAAGTAGGAAATGCTATGCTAGATTTTG CTTT GGCCGCTAAAAT GAT GTT CGCTGCCTT 
TACACAGTTTAAAGAAGCTGGATTTAACGAAGAGCAGTCATTCGAATTAACACGTGAGATATTAATTGATTCATTAAGTA 

AGAAT CAAT AG AT CAAT GAGGT GAAAG G GAAT G CAAGT ATATT G C TCT GAGT G TGATAAAAG T TAT GA CAT G CAG C C GCA 
AGTAACACAACTCCCTAATCGTATTGAGAAGTGTTTCTTTATTTGTCCTCATTGTAATCATGAACATATAGCTGCGTACG 
T GAAT GAT AAG ATT CGT AAGT AT CAAG CAG AT ATAG CAAAGTGT CAT GAGC GGATTAATAAAAAGAAT CTT G CTAT CGAA 
GAT GAAAT GAAAC GAT TAAG GAAG AG GT TT GAC AG G AGAAAGT GAGAG GT GAAG CGAGTTT G AAAAT G CTATTAACAAAG 
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CATTGGTGTTTAGATAGAAACTGCGGATTTGAAGAGACTTCTCATAAGGTACGTGATGGTTGGAAATGTCCTGATTGTAA 
TGGACCAATGGCGTTTCAACAGGTGAATAAGAAAAAAGAAAGCGCCAAGTGATGGTGCTTTTTATTTTGGAGGAGGATGA 
AGGATGGAAGGACAGGAGTTAACATTGGAAAAGAAAGACAGTATTTATCTTAGACCAAGATACCCTCATAAGATTGACGC 
AAGTAAAATCAAATCCTTAAAAGATGTAATTAAGATTTTAGGATTGATGGATATTCGTTTGGACGACAAGGCGGTCA.TTG 
GT C T AG AAC ACTT G ATT G AAAAG GAGG AAG AATAAAAT G GCCAAT AACAAAT TAiATT ATT GAAGT AACT GCG GATACAAC 
T GAGG CATT AGAAGGAAT T AAAGAAGT AACT GAAGC AGCTAAT GAAT GT GCAGAT G C G CT G GACAAAT TAGAAAAGATTA 
TGGATAAGTTTACAAATC GAAGT GATACAGTGGAACTCTATTGTGAAGGTAAATTGTTATC GAAGT CTACAGTTAAT CAT 
ACAGCTGATTCAATTCAATGTCGCATAATCAAGGGAGAAGAGCTTGGAGGAAGTGAACGCTGATGAAGAAACCGCTTAGA 
CCATGCTGCGAATTTCATTGTTATAATCTCACACGTGAAAGATATTGTGAGGAACATAGATACAAAGA.GAAGGAA^.CGCA 
GCAGGATAAGAATAGATACTACGACCGATTCAAACGGGACAAAGAGAGTACGGCTTTCTATAGGTCAAAGGCATGGGAAA 
GGTTAAGAGAGCAGGCACTAAT GAGAGACAAAGGGTTGTGCCTACATTGTAAGAACAATAGAAAGATTAAAGTTGCAGAT 
ATGGTTGACCATATCATTCCTVATCAAAGTTGATCCAAGTTTAAAACTCAAATTAGAAAATTTACAATCACTTTGTAATCC 
AT GT CACAACAGAAAAAC AGCAGAAGACAAAAAGAAATAC G G GTAG GG G CG G GTC GAAAAAC AT T C AG GGCGGTCTGTCC 
GTACcgccgcccc 
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FIGURE IB: Polypeptide Sequences grouped h>y open reading frames (ORF) of Phage 
Gamma (y) 

y phage: Polypeptides Encoded by Polynucleotide Open Reading 
Frames 

Orfl (SEQ ID NO: 3) 

MAGRNKQPLSVIQGKGRSNHITKSEKNRREKQEE^RGHTDKIEAPSYLTJ^QKKEFDTIJ^VELVRLKIFS 

NLDVDSIARYVDSKDQYIKMVRLLRKTKPSDDFKLYSQMQRSKNLLFNECRSSASDLGLTITSRLKLVIPE 
VDTSQQKQSEAQKRFGDRI 

Orfl (SEQ ID NO: 5) 

MNWIMERVFAYCEDILNGKINSCKKHRWAIERFIRDYEECQSEDSPFYFDGEIAEDFYWFAKEFKHVEGI 
I^GESVELTDFQLFLAANIFGFKKKINGARRFRKVFIQLARKNAKSQFLAIVAAFCT FLGDEKQRAYIAG 

WTRDQSSEVYEAVKTGISSSELLEGKWKEAYSTIEIFKNGSVVVPLSKEARKTGDGKNPSLGIVDEYHAH 
ETDEI YDVLS SGMVARKEPLMFI ITTAGFDLSRPCYREYEYVS DI LDP S KNVENDDYFVMI CELEKNDDI 
KDESNWIKANPIVATYEEGLEGIRSDLWALDRPEKMRAFLT^ 

DFSGATLWIGGDLSMTTDLTSVGWVGMDDEGDFIVGQHSFMPEARLKEKMAIDKVRYDLWAEQGYLTLTP 
G EMVD YT I VE S W I EN FS KDKEI QEFD YDKWNALHLAQN LENKGFVCVE I P QRI AN L S I P T KN FREKVY EK 
KVKHNGDPVLFWALNNAVVKMDDQENIMISKKI SKNRI DPA?VAVLNAFSRAMYGASVRFDVSEFANKDFL 

GKLWN 

Orf3 (SEQ ID NO: 7) 

VKIVDSVXKFFWFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKI LSESVSKLPL 

KIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKWLYGNSYANIEFDRKGKVQALWPIDASK 

VTWIDDVGLLNSKTKJ^YVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFIN 

NFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTEI, 

TIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVD 

AILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSK 
EGNEGN 

Orf4 (SEQ ID NO: 9) 

MEKSAKKEMKEIRALPMTIEVREVNEDEGKRTISGSIKYNNESAEMRDWWGDTFVEEIAEGAFDESLKVRD 
WGLW S H DT S QVLGNTKS KT LRI ENDKKELRFELD I PNTTVGNDAWELI KRGDVDGVS FGMKVTKDKW S S E 
ERENGKLYKRS I LNAEL YEI S PVAFPAYPTNEVS VRSLDDFKAGEKRVADEFRKRKLQI ELELI 

Orf5 (SEQ ID NO: 11) 

MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQK^^ 

GEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTV 
EPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSWVQYAVKDRAGILPLSRSLLQDSDQNILKYVT 
KWLGKKSKVTRNVLILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKY 
ILQSDPTQKNKKLFAGTNPWWSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTR 
NTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 

Orf6 (SEQ ID NO: 13) 

MLVTLEEAKEWIRVDGDDDPTITMLIKAAELYIYKATGKTFTQTNEDAKLLCLFLVADWYGNRLLVGEKAS 
EKIRTIVQSMILQLQYASEPQEERK 

Orf7 (SEQ ID NO: 15) 

1^NPAKLDKRLTFQVKDENAKGPDGDPIDGYKDAFTW7GSFVYLKGRKYFEAAAANSEVQGETEIRI>JRDDVS 
ADMKI KYKNVI YDIVSVI PTQDHTLLIMWKRGEMNG 
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OrfB (SEQ ID NO: 17) 

MKLTLMINKEKQTFNMPEFIPARLIRQAPELAEIPNNPGPEDMDKMVQFWKVYDGQFTLDQYWDGVDARK 
FLSTTSDVINAIINETVEAAGGSTESGEEENPNA 



Orf9 (SEQ ID NO: 19) 

VINLRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFELDNRPDGFADNQEIESEILFQVDVWA 
KSSTTAIHQKVNEIMKRIGFSRYAVADLYEEDTQIFHYAMRFAKGVEL 

OrflO (SEQ ID NO:21) 

MAGE WRI S S T VG VDN LVYAKVLQDD S S AI K YTDVKKMEGAVKVKLT KKVAS EVMW S DN RK S E I AE S DGET 
EVEIEVRGLSLSTKADI EGFPEVKDGVLDEKREGEKPYLAIGFRFLKANDKYRYVWIjLKGKLSQEEEEAET 
KKDKPNFQTTKLKGSFIERDFDDRTKFTADEDEPTFTKLVGDNWFNKVYEKPVTQPPAGK 

Orfl] (SEQ ID NO: 23) 

MKLTLMINKEKQTFNMPEFI PARLIRQAPELAEIPNNPGPEDMDKMVQFWKVYDGQFTLDQYWDGVDARK 
FLSTTSDVINAIINETVEAAGGSTESGEEENPNA 

Orfl 2 (SEQ ID NO:25) 

MDELYLSLLRQGYKHHHIDNEMDIWHYLRLNRKMHENGNENYEGSNSNEIEVPAENII 
Orfl 3 { SEQ ID NO:27) 

MANEINNLWRLSLDNWFRQGISNSGRAWTLQNELKSVSTGMGGFANASQQTQAKMNTLSRLIDAQKEK 
VKALRQAYDQNKAKLGENDAATQRYASQVNKAVADLNRFENELKQVNRQAEQKGMDKLNNSLKSLQAEFQ5 
ITTGMGGFSNATEQTRAKVDVLSRMVDKQKEKIRELQQAYNRAKTEEGEASQSAQRYAEQIHRATAELNRF 
ETGLQQSNRELEQQGNRLLNFGNRMETLGNHLQNAGMQIGMVFGGMTYAIGRGLKSAITESMIvIFEQQMANV 
KAVS G STGAEMKKLS ELAVNMGETTKYS SVQAGQGI EELI KAGVSIjQDI I N GGLAGAIiNLATAGEIiELGEA 
AEI ASTALNAFKADHLSVADAAN I I»S GAANASATDVRELKYGLS AS S AVAAGAGMT FKDTATTIAVFAQNG 
LKG S DAGT S LKTMLMRLN P S TKEAYN KMRDLGLI T YNAQAG FD FLVKNG I Q PAS RN VG D I EVALEQ YVMKT 
EGVTKWNDKCDTTFRELATSSAFLSSKFYDQQGHIQSLENISGTLHESMKDLTDQQRSMALETLFGSDAVR 
GATILFKEGAKGWEMWDSMSKVTAADVAATKIDTLKGRL^ 

LVDGFNSLPGPVQKAIAITGGIVIALTAVATAIGVVLAAFGMlASGIGSLSLAi 

GSALAVLTGPIGLVAAALIGTGVVAYKAYQKATEDSIASVDRFATNTEGKVSSSTKKVLGEYFKLSDGIRQ 
KLTEIRLNHEVTTEEQSQKLI GQYDKLANTI IEKTNARQQKEI EGLKKFFADSYVLTAEEENKRI EQLNQH 
YEQEKLKTQEKENKIKEILQTAARENRELTTSERI SLQALQDEMDRVAVEHMSKNQMEQKVILENMRVQAS 
EI S ARQAAEWEN S AKARDKVI EDAKKTRDEKIAEAI RQRDENKTITADEANAI IAEAKRQYDSTVS TARD 
KHKEI VS EAKAQAGEHANQVDWETGQVKSKYQAMKDDVT RKMKEMWS DVTNKYEDMKN SASNKVEEI KNTV 
SRKFEEQKKAVTDKMSEIKSSIEDKWNTVEKFFSSINLRSIGKSIIEGLGKGIDDASGGLFSKAAEIASDI 
KKTISGALEINSPSKVMI PVGSAVPEGVGVGMDKGKRFWDAAKNWGTVKKQMGNMPSVFDFGFQTNQYS 
I PQNTFSDFSGYMQPQLSYNNPSl^KTIFPNRPGGEQELNLTVNMThTVLDGKELANGSYTYTTKLQNREQK 
RRAEF 

Orfl 4 (Tail fiber) (seq id no: 29) 

LGKLS FTFNMI RKDYI QMLVGRKRP S WAP VKRRLVRVPHRAGALLLNTETEERRI DVPLVI KAKKDMADLQ 
KLKEDLADWLYTEQPAELIFDDELDRTYLALIDGSVDLDEIWRGRGVITFVCPMPYKLGKTNTHKFTQEW 
STETTSYFTNKGSVEAPALIEMTVKKPSTFLDVWFGEYPNNRDYFRIGYPLTVEETTVQERERVMWDEMAT 
PI GWTPVTGQFDDMKGTGS FKSRGGYALYCEDYGKDVGFYGAIAKKNI PGGPLQDFEMEAWMTLKSKNI GE 
MGRVEVLLLDEASNWARINl^DLYATAEITRAHMKIGNSGTPNSFRKLVDTSGYYSNTFHQFRGRLRIAR 
RGK^JSVYVAKFIDGTEKDGASLVERWIDETGNPMTERKJ^^ 

VPSNAQPYIFDTGDKIVIDTEKSLVTINGKNAINIKEIFSNFPVIIRGDNRIDIMPPDWATISYRERYR 
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Orfl5 (SEQ ID NO: 31) 

mrtpsgilhwdfktdqivaaiqpedwddkrhwelknnvdmldftafi3gtdhavtlqqqnlvlkevrdgr 
ivpyvitetekn sdtrsittyasgawiqiaksgi ikpqriesktvnefmdlallgmkwkrgiteyagfhtm 
tideyi dpltflkkiaslfkleiryrveikgsriigwyvdmiqkrghdtgkeielgkdlvgvtriehtrni 
csalvgfvkgegdkvitiesinkglpyivdadafqrwnehgqhkfgfytpeteeldmtpkrlltlmeielk 

KRVN SSI S YEVEAQS I GRI FGLEHELIMEGDTIKIKDTGFTPELYLEARVIAGDESFTDSTQDKYEFGDYR 
EIVNQNEELRKI YNRILSSLGNKQEMIDQLDRLVQEIANETASNAKKESEAAKTIAEKVQENIKNNTVEIIE 
SKNPPTTGLKPFKTLWRDISIGKPGILKIWTGTAWBSWPDVESVKKETLDQVNKDIATTKTELNQKVQEA 
QNQATGQFNEVKESLQGVSRTISNVENKQGEIDKKITKFEQDSSGFKTSIESLTKKDTEISNKLNTVESTV 
EGTKKTISEVQQTTWDLKKKTTEIEEKAGKITEKLTSLETREVNVRNYVINSDFSNVTNSWIGITNATLFK 
FVDVNI SEASAI KKGLQITSNKAFVYQKLPADVFKKKKGIAS CYINVSS FTPGTDYPRLYMRFTYDQNGTE 
KQY Y AI LKQQEVTN GW I RI SIP FDTT G YTG ELKEVRVN I ATADTT T I DAT FT G I MVT FGDL I ES WN LAP ED 
GVTQGVFQSKTTEIEKSVDGVKTTVTNVQNSQAGFEKRMSNVEQTATGLSSTVSNLNNWSDQGKKLTEAN 
TKLEQQATAIGAKVELKQVEDYVAGFKIPELKQTVD^KQDLIiDELAl^KLATEQFNQKMTLIDNRFTINEQ 
GINAAAKKTEVYTKTQADGQFATDSYVRDMESRLQLTEKGVS ISVKEWDVI AAINMSKENI KLNAARI DliV 
GKVNAEWIKAGLLSGCQIRTSNTDNYVSLDDQFIRLYERGVARAFLGHYRRSDGAVQPTFILGSDEKTNAP 
EGTLFMSQAGAGWSGAYASIGISNGIVDGAVQKSVYWELQRNGLSVLMANDYHVFYAGNGNWYFRRGKPGL 
YQT S LWEDN S T D S D LRL PN VT I RN S RAAG YT GVI QLKS P VTQN GW GAVQ GN FMT P S LREY KS N I RD I S F S 
ALEKI RS LK I RQFN YKNAVNELYRMREEKS PWDP PLTTEDI KT YYGLI VDECDEMFVDESGKGIHLYS YAS 
IGIKGLQEVDATVQEQEVEIANLKSQIASQEDRIARLEELLLQQLINKKPEQP 



Orfl6 <SEQ ID NO: 33) 

MDRIDVLLKAFIAAFGGFCGYFLGGWDATLKILVTMWIDYLTGMIAAGYNGELKSKVGFKGIAKKWLFL 
LVGAAAQLDSALGSNSAI REATI FFFMGNELLSLLENAGRMGI PLPQALTNAVEI LGGKQKQEEKKGDVQ 



Orfl7 (PlyG lysin) (seq id no:35) 

MEIQKKLVDPSKYGTKCPYTMKPKYITVHNTYNDAPAENEVSYMISNNNEVSFHIAVDDKKAIQGIPLERN 

AWAC GDGN G S GN RQ S I S VEI C Y S KS GGDRY YKAEDNAVDVVRQLMSMYN I P I EN VRTHQ S WS G K YC P H RML 
AEGRWGAFIQKVKNGNVATTSPTKQNIIQSGAFSPYETPDVMGALTSLKMTADFILQSDGLTYFISKPTSD 

AQLKAMKEYLDRKGWWYEVK 



OrflS (SEQ ID NO: 37) 

MKMYKKLI S I CI GSTLLLGLTACDS SKQSES SEKTNVKSQPETKKDLTSQDELN KKI KQDAEEVS FVKAN 
GDQYEKGKRLKATGTVDLLLKS SALPSFVISTNENDGKGMYTI QIVQSGVQTNENEITLKNGLKI SKGS I 
VTI YGAYDEKDKTGMPKI SATVI EQ 



Orfl9 (SEQ ID NO: 39) 

VRLKCKLRVI FAEREI RQKEFSKLI GI SQTTMS SLVNNTTLPTFLTAYKI AKELKLHMEEIWI EEENENV 



Orf20 (SEQ ID NO: 41) 

MRWQYNHLNTTPYLHPSKELCSMYNGSRSRAETESILNHMKNHEVYDRKEYKGYFSLSQVLEEDLYGEEED • 
VLNWEI LMDC YDWLTRKGI AFREKEEEEQA 

Orf2l (SEQ ID NO: 43) 

MTLAGEAI 1 1 WTAT G L S WAMKAAEKMG KS VP HWL P RVT L YTT LT G S FL YL LR YVLVL FL 



Orf22 (SEQ ID NO: 45) 

mwkl f ipy vi rsla cMHVFLET GI YTLYKRDI RS DFMLELLS VP FAG LI FAI VGERLKGRES DRKKI QVFF 
EVSGIAIRREDKLQYPVFLEQKEDDRSTTYIYRLPVGMPSKI IQKVEDWSEGLSKPVRIDYDNYKLNI RV 
FHRDI PKKWSWSKGLVAEGSWCVPMGQSLEKLI YHDFDKTPHMTLGGIiTRMGKTVFLKNWTSLTLAQPEH 

INLY 1 1 DLKGGLEFGPYKNLKQWSIAEKPAEAFMI LTNI LKKMEEKMEYMKCRH YTNWETNI KERYFI I 
VDEGAELCPDKSMKKEQQRLLGACQQMLSHIARIGGALGFRLIFCTQYPTGDTLPRQVKQNSDAKLGFRLP 
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TQTAS SWI DEAGLETI KS I PGRAI FKTDRLTEIQVPYISNEMMWEHLKGYEVEKHEDANAYANQPSNGDT 
CDD . . 

Orf23 (SEQ ID NO: 47) 

mrwrnmrmqthmqinrgmailatirklqf a trrhlMSIHEMGGIRNANRILKDLSI YTSKWYNKEHVYYL 
NQSGHKLFGEGKVVHHGKVTHALLRNEAWLNLYC I PDVKFRDEDRILHAV 

EIDRTQKMIWDEKLKKYEELTQIYKQKHNGKVPVIHFFTITKYREKKLEELANKYNVFVKVYVIATT 

Orf24 (SEQ ID NO: 49) 

MKFTLGNSIiDELGITKNKLSTESQVRYNTISDLVNGNANAVRFDSLEAIIDALNAIAAEKGINKIYKIDDV 
IQYIKKS 

Orf25 (SEQ ID NO: 51) 

MAFKASMIASSESKRTALALPFTKSLIVLYLTWDSVDNLFLVIPNSSKEFPSVT^IFILFS SAALVI LYSFY 
NINFN 

Orf26 (SEQ ID NO: 53) 

MLS SANYTQYKKLQS FRS VEEMNEAI CS FLYKHTHELS ESAI KVLKFLARHSCKI PGVS FLKVGTIAEALN 
ISDRTVRRVLKVLEDFEVVTRHKTIRTEGKLRGGNGHNVYVLLKKYSWPNVLPKMSQRQDEENLTESKVS 
DT KTDKEAKLSESHPLEELKSELNVKETS ARES KEIELEDLDET FTP ENVPSQFRD WAP FFKSADKIYKL 
YHRVLIAYKRSKIDKPIEQVINQAIQAFKETVFAEKANKIRSTFEGYFYRIVESKFVMERRKECRGLLFDW 
LNE 

Orf27 (SEQ ID NO: 55) 

LKYAVWRVSTDRDEQVSSVENQIDICRYWLEKNGYEWDPNAWFDDGISGTAWLERHAMQLILEKARRlvIE 
LDTWFKSIHRI^DLRDALEIKEIilGHGIRLVTIEENYDSLYEGGNDIKFEMFAMFAAQLPKTI SVSVS 
AAMQAKARRGEFIGKPGLGYDVIDKKLVINEKEAEIVREIFDLSYKGYGFKKIAI^ILNDKGTYTKFGQLWS 
HTTVGKILKNQTYKGNLVLNSYKTVKVDGKKKRVYTPKERLTIIEDHYPTIVSKELWNAVNSDRASKKKTK 
QDTRNEFRGMMFCKHCGEPITAKYSGRYAKGSKKEWVYMKCSNYIRFNRCVNFDPAHYDDIREAIIYGLKQ 
QEKELEIHFNPKNHQKRNDKSTEI KKQI KLLKVKKEKLI DLYVEGLI DKEMFS KRDLN FENEI KEQELALL 
KLTDQNKRNKEEKKIKEAFSMLDEEKDMHEVFKTLIKKITLSKDKYIDIEYTFSL 

Orf28 (SEQ ID NO: 57) 

VI I VEFKDRLRQLRRERNLTQHDLGQAI GVTAGS I TVTNNQL 
Orf29 (SEQ ID NO: 59) 

MKVI KDETKLKAAFKKSGYKYQELADELEISCSYCYKLINNHNYKKKI SYNIaASRMAHVLNASVVDLFEEQ 
VDFF 

Orf30 (SEQ ID NO: 61) 

MREH RGERAMS E I Y YKG FI I KET YGERN I EEVFKEA YE S F YGVEVKWKKELGT KRN S AAS 
Orf31 (SEQ ID NO: 63) 

MDQLTVASELRLLGRRKVAGYEFTGIEGGFGEGKKAMLVLDIATIHNQPLKEINRRINDNRIRFKDGVDIV 
DLKSGGFNPPQLLNLGFSNMQIAKSNNIYLLSERGYAKLLKILEDDKAV7ELYDILVDEYFNMREKNQVATD 
PMSILKLTFEALEGQQQAIEEIKSDVQDLRENTPLFAIECDEISTAVKRQGVILLGGKQSNAYRNRGLRGK 
VYRDI YNQLYREFGVKSHKAIKRCHLNVAVKIVEEYTLPIVLSEEISFVl^AQMDFTEM 

Orf32 (SEQ ID NO: 65) 

MDQLRVI EGEKVDKPDYVEIYLGAFMNAVI>3ELiKKQDEETRSLSKDTYKKAI FYGVRYI SI SKNDSLNYDYL 
MNRFLLI SYLENLMKVLTPRDFMTI FPIDKNYDGARYEMKDYFFTMNEIKKIGMDTPIGEKIMEFLVJDYQH 



13/37 




14/37 



PCT/US2005/009928 



S ni Q Q p !Q> 

It... I' -.I*' .<»> It,.... •(...!> 



FKDITLFNIiASVSILNKLQKMQGKKTLTEEFAERLGIDTYTKHKEKGGKEYITNDRTGEIQEVKKSRPRYL 
KPVQ 

Orf33 (SEQ ID NO: 67) 

MALFR1<VHTEFWTDVKVSED^1TPEDKLF^^^LLTNPHTTQLGVYEITPKMIAFEIGLSIESARALLERFEN 
HHKLI KYNKLTREIAI KNWGKYNLNRGGKPI EDCLKREI DKVKDLSLI KFI LEHTDHAALKRKINLYAGFD 
DTSHDTLAI RDQEEEKEQKKEQKEEQEEKEKEKEKQKEEEKEPEEEKTRIKSKASLKSDAKSNPIPYKDIL 

DYLNEKANKNFNPKAEGHRKLIRARWNEGYKLEDFKKVIDNKTTQWFGKKSFDGKPLDQFLRPSTLFAQKH 
FDN YLNETVN I SNQQHGDQIVI PGFRGEMPF 

Orf34 (SEQ ID NO: 69) 

VKKIQDSFEKLTKLKFADEQCDKHTFNKHGKEVIKLVRmiDDAGTVYCPRCMVEEQNSVLFQQAWNHYKK 
INRERKKNVLFQHSIIENQSITESRLSTYKTDCQETKENKEKAIKILERIKNGEFLNVYIAGIQGVGKSHL 
AYAMLYELVKHYWVISDGEKLNDEHAFKNMKSCLFVEIEKLIRLIQHSFRNIESKYTMDYCISLMVDVDFli 
VIDDLGAESGSMNRNGEASDEVHKILYG\n^NGRQGAKKTTITTSNLSSAQLFQKYDPKlASRLLNGVSKDE 
TIVFKTTTDKRIVNLDIGF 



Orf35 (SEQ ID NO: 71) 

MKEVKGKNTKLMEEFDVLLRQLLIKSKTDERVKNFLDDLFEMLSDNKLQSDIDFKTALNKLREKHFPKFDK 
GESKND 

Orf36 {SEQ ID NO: 73) 

MTKEKGQAKEWNVRGMSDDEFI EKYGRLVHHCVWKRYAKKKAS I ERDTGLDI EDLTQFGMI GLI KARDNF 
DLEFGCAFSTYAVPKIIGEIGRAIRDNQKIKVQRTVYGVKGKILNQQLADKEPEEIADILDESVSLVKTAL 
EYQPSTDSLN KXATYASGANEELTLERMIEDTKTEDI EETTINRAVI REFKAALP PKEYI VLDMRLQNMTQQ 
NIANQMGYSQVQISRILAKINQRAAQFGKEGGLQD 

Orf37 (SEQ ID NO: 75) 

LSVTKGVCIDVDHSDLLHEKVEYFLFPAKPSHYYVSRFNRKGAHFGCYQAERFQITEKEVWTPEPQPNLPE 
LNTSLFYRAQLIWRKKGYKDKPLKDYIVQPRGKHCYFWHDRERKKFCGCFPLHWFTDFVPVQSHHIEEKTR 
EEVKLLQRPDGQLAFF 

Orf38 { SEQ ID NO: 77) - 

MDIKKLFAMQNILDKRVLESKNLSRGEVFEFRILAFLDELGECMKEWRVFKFV7SDDRKPRTSIPTGEIIVL 
DDGYEVEVYKNPLLEEYVDGLH FAI GIjCI DLKTEIN FPASMRCETVTEQFFELYHIiAI RLKEEPTAFRADV 
LLSHYLGLGELLCFSLEEIGHEYIEKNKINHERQSNGY 

Orf39 (SEQ ID NO: 79) 

MRVIEISWWAIAIGLYLLIGVALLIWIIATDSWGSLFLYPVFAWIVLGWLPLMIRSIVQEISKAIHKWK 
RKQKTE 

Orf40 (SEQ ID NO: 81) 

MSGCTIVNVKINKQKRGMKDMKWMYNLDSNNEIWTSDKFEMK 

GQFKQYSPWINADVLLDELYERATDECGEVAEYWLSGVPMDEGEKLQEQINKVVTEWLKGINEHPSFGSI 
ENIETIDASKIEYKEN 

Orf41 (Fosfomycin resistance gene) (seq id no:83) 

MYQTWKWLLNSIKKILQAKLLVKGRKLAYFDLNGLWIALNVEEDIPRNEIKQSYTHMAFTVTNEALDHLK 
EVLIQNDVNILPGRERDERDQRSLYFTDPDGHKFEFHTGTLQNRLEYYKEDKKHMTFYI 
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Orf42 <SEQ ID NO: 85) 

MIVKATIKLELDDSQKNWVSYVREQGGEEAVFHYLEEEVQKKIELADFVfeMKYKNK 
Olf43 (SEQ ID NO: 87) 

MDMSLVGNLKELQEKAIDEKVLEFAEEMEIVITKSAASGYSGHRYKIHNENPNRHMMCSKIFIEKLOEI LD 
GVKVEFKEEEKKNI LGGS YYEHYI RFKWND 



Olf44 (SEQ ID NO: 89) 

MTN FLLKI LFWRKGVERMKTFNVTFTELKI YEAVI EAESAEKI I DVI KHLKRTEDDLVDKGVI I N EVS E IN 
VSKEQKFE 



Orf45 (SEQ ID NO: 91) 

VN HH L FNW L RD YQ KL E EDI AY L E YN L D KT KAE LPJRWVS G D LREVRLT AE S EGA FCVEW RI EAI E YELAH KMN 
DMYKLKKLI S KFRGLENQI LKLKYVDGMTLEEI AEAWIYS S SHI K?;KHAELVRLI KFVEREGVI 



Orf46 (SEQ ID NO: 93) 

MDVQELSRRLENLEHKVLQVETKADVLNRTAIQKGDKIKVVYPHLGIQGEYLVEKIDNGVLELVAEETMKK 
IQE 



Orf47 (SEQ ID NO: 95) 

LKKLSKQEIjAAVMTHCISTLGEQIVNEHINPQKLAQASALHNDLFDNTTPKERREATISLLGKAIDEFLES 



KE 



Orf48 (SEQ ID NO: 97) 

MGKGYFNKAVCLVCGHQDRVNHPSKKEYQEVTVCPECNGAFVDVWKLGKYKRNTQSNEEPLLTITLTDIDA 
KPIVHYKGEQIDRKLRVTFDWESQSIDKINRTYIHIEHVPADNKRLNTETIQHNHPIANKEQV 

Orf49 (SEQ ID NO: 99) 

MNGFMKIVNDMQNEQVGNAMLDFALAAKMMFAAFTQFKEAGFNEEQSFELTREILIDSLSKNQ 



Orf50 (SEQ ID NO: 101) 

MQVYCSECDKSYDMQPQVTQLPNRIEKCFFICPHCNHEHIAAYVI^DKIRKYQADIAKCHERINKKNLAIED 
EMKRLRKRFDRRK 



Orf51 (SEQ ID NO: 103) 

MEGQELTLEKKDSI YLRPRYPHKIDASKIKSLKDVIKILGLMDIRLDDKAVIGLEHLIEKEEE 
Orf52 (SEQ ID NO: 105) 

LKRRKNKMANNKLI I EVTADTTE1ALEGI KEVTEAANECADALDKLEKIMDKFTNRS DTVELYCEGKLLS KS 
TVNHTADS I QCRI IKGEELGGSER 
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MKKPLRPCCEFHCYNLTRERYCEEHRYKEKETQQDKNRYYDRFKRDKES^AFYRSKAWERLREQALMRDKG 
LCLHCKNNRKIKVADMVDHIIPIK\^DPSL 

KLKLENLQSLCNPCHNRKTAEDKKKYG 



16/37 



WO 2005/089527 PCT/US2005/0O9928 



FIGURE 2A: Polynucleotide Sequence of Phage W (SEQ ID NO: 2) 

CTCAACTTCGCaGaAAAATCCGTTTTTGCATATTTTTTTAAGGGGGTGTAATCATGGCTGGAAGAAATAAACAACCACTC 
TCTGTTATACAGGGAAAAGGTAGATCAAATCACATTACAAAAAGTGAGAAAAACAGACGAGAAAJ^ 

GCGGGGGCATACTGATAAAATTGAAGCTCCTTCTTATTTGACTGCAGCACAAAAAAAGGAATTCGATACTTTAGCTGCTG 

AATTAGTCAGATTGAAAATTTTCAGTAACTTAGATGTTGACAGTTTAGCAAGGTACGTTGATTCTAAAGACCAATATATA 

AAAATGGTTCGTCTGCTAAGAZVATVACAAAACCTTCAGATGACTTTAAATTGTATTCTCAAATGCAAAGAAGTAAAAATCT 

TTTATTCAATGAATGCCGTTCTTCAGCTAGTGATTTAGGTTTGACCATTACATCCCGCTTAAAATTAGTTATTCCAGAAG 

TAGATACTTCACAACAAAAGCAAAGTGAAGCGCAAAAGCGTTTTGGTGATCGTATATGAACTGGATAATGGAACGGGTTT 

TTGCATATTGCGAGGACATTTTAAACGGCAAGATAAATAGTTGTAAAAAACATCGTTGGGCCATTGAACGATTTATAAGG 

GATTATGAGGAGTGTCAAAGTGAAGACAGTCCTTTTTATTTTGATGGAGAGATAGCGGAGGATTTTTACTGGTTTGCAAA 

GGAATTTAAGCACGTTGAAGGGATTTTGGCAGGTGAATCCGTAGAATT/IACTGATTTTCAATTGTTTCTAGCGGCTAATA 

TTTTCGGATTCAAAAAGAAAATAAATGGAGCAAGGCGATTTAGAAAGGTTTTTATTCAGTTAGCGCGTAAAAATGCTAAA 

TCTCAGTTTCTTGCTATTGTAGCAGCTTTTTGTACATTTCTTGGAGACGAAAAACAACGGGCTTATATTGCTGGATGGAC 

AAGAGACCAATCATCTGAAGTTTATGAAGCTGTAAAAACAGGGATTAGTTCTAGTGAATTGTTAGAAGGTAAATGGAAAG 

AGGCTTATAGTACCATTGAAATATTTAAGAATGGTTCAGTTGTCGTTCCACTTTCAAAAGAAGCTAGAAAAACTGGTGAT 
G G T AAAAAC C C GT C T CT T GGAAT T GT C GAT GAATAT CAT G CACAT GAAAC T GATGAAATTT AT GAC G TTTTAT CGT C T G G 

TATGGTGGCAAGGAAAGAGCCGTTAATGTTTATCATAACAACAGCTGGTTTCGACTTATCAAGACCTTGTTATAGAGAGT 

ATGAGTATGTCAGTGACATCTTAGACCCGTCAAAAAATGTAGAAAACGATGATTATTTCGTTATGATCTGTGAATTGGAA 
AAGAAC GAT GATAT CAAAGAT GAGT C GAATT G G ATAAAAG CAAAC C CAAT C GT AG CT ACATAT GAAGAAG GTT T G GAAG G 

TATACGTTCAGATTTGAAGGTTGCTCTTGATAGACCTGAAAAGATGAGGGCTTTTTTAACCAAAAACATGAATATTTGGG 
T CGATAAAAAG GACAA C GGATACAJT G GATAT GT CAAAAT G G CAAAAAT GC GAAGTAGATAC C TT T GATTTTT CAG GT G C G 

ACTCTTTGGATAGGTGGCGACTTATCAATGACAACAGATTTAACTAGTGTCGGTTGGGTTGGaATGGACGATGAAGGTGA 
TTTTATTGTTGGACAACATTCATTTATGCCTGAAGCACGTTTGAAAGAAAAGATGGCCATAGATAAGGTGCGTTATGATT 
TATGGGCCGAACAAGGGTATTTAACTTTAACGCCTGGTGAAATGGTTGATTATACAATTGTT GAGT CTTG GAT AGAAAAC 
TTTT CAAAAGACAAAGAAATT CAAGAGTTT GAT TAC GATAAAT GGAAT G C GT T ACAT CTAG C ACAAAAT TT AGAGAATAA 

AGGGTTCGTTTGTGTAGAAATCCCTCAAAGGATTGCTAATTTATCCATTCCGACTAAAAATTTTCGAGAAAAAGTATACG 
AAAAGAAAGTTAAACATAAT G GAGAT C CAG T C CTTT TTT G GG C G CT TAATAAT G C T GT T GTTAAAAT G GAT GAT CAG GAA 
AAC ATTAT GATTT CGAAAAAAATAAGTAAAAAT C GTATT GAT C CAG CAGC AG CG GT CT TAAAT G CATT T G CTA G G G CTAT 
GTATGGAGCAAGTGTCAGGTTTGATGTATCTGAATTTGCAAATAAAGACTTTCTAGGCAAGTTATGGAACTAGGGAGGGG 
GTGAACATGTGAAGATAGTGGATTCTGTTAAAAAGTTCTTTAATTTTGAAAAACGCCAAACGTCGCAGGTAATAGAGTTG 
AAT AAAGAC GAT GAAAAATTAT TAGAAT G G CTAG G GATTT CTC CAAGTAC TATTAG C GTTAAAGGAAAAAAT GCT TTAAA 

AGTTGCTACAGTCTTTGCTTGTATCAAAATACTATCTGAATCCGTATCAAAGTTACCGTTGAAAATTTATCAGGAAGATG 
AAT AT G GAAT C CAAC G C G GTACAAAG CATTAT C T CAAC AATTTACT GAGACTAAGG C CTAAC C C GT ATAT GT C C AGTAT G 

AACTTTTTCGGATCATTAGAAGCTCAAAAAAATTTATATGGCAATAGCTACGCTAACATAGAGTTTGATAGAAAAGGTAA 
AGT C CAAG C GT TAT GG C C GATA GAT G CTT C TAAAG T GACAGTATACATT GAT GAC GTT G GT T TATTAAAT T C C AAAACT A 
AAAT GT G GT AT GT AGTAAATAC G G GT G GACAACAAAGAGTGTT AAAG C CAGAAGAG ATAC T G CACTTT aAAAAC G GAAT A 
ACTCTT GAT GGT CTT GT C GGT GTT C CTACAAT G GAAT ATTTAAAGT CTACATTAGAAAATT CAG CTT CAGCT GATAAATT 
C AT AAATAATTTTTACAAACAAG G GTTACAGGTAAAG GGATTAGTT CAAT AT GT C G GT GATTTAAAT GAAGAT G C GAAAA 

AGGTTTTCCGAG7\AAATTTCGAATCAATGTCTAGCGGTCTTCAAAATAGCCATCGTATTGCATTAATGCCAGTAGGATAT 
CAATTT C AACCT ATTT CATTAAAT AT GT CAGAT GCT CAATTT CT C GAAAATAC C GAACTTACTATTAG GCAAAT CG CTAC 
T GC ATT C G GCATT AAAAT G CAT C AATTAAAT GATTT GAGT AAAG CG ACT T TAAAT AAT ATT GAG CAGC AG CAAC AAC AAT 
T CTAT AC C G AT ACATT ACAAG CGACTTTAACAAT GT AT GAG CAAGAAAT GAC GT AT AAG CTAT T TTTAGACAGT GAGTT G 
GAT AAG GG GT T TT AT T CAAAATT CAAT GT AGAC GCTATTTTAAGAG C GGAT AT C AAAAC GAG AT AT GAAG CTT AC AG AAC 

GGGTATTCAAGGCGGTTTCCTTAAACCTAACGAAGCTAGAAGTAAAGAAGATTTACCACCAGAAGCTGGTGGGGATCGTT 
TACTT GTT AAT GGAAAT AT GTT GC C GATT GAT AT G GCT G GAC AG G CAT ATTT GAAG G GAG GT GAT ACTAAT GGAGAAGT C 
AGCAAAGAAGGAAAT GAAGGAAATT AGAGCTTT GCCAATGACT ATTGAAGTCCGTGAAGTTAAT GAGGACGAGGGAAAAC 
GAACAATTTCGGGATCGATAAT^ATATAACAATGAAAGTGCCGAAATGCGTGACTGGTGGGGCGATACTTTCGTAGAAGAG 
ATTGCTGAGGGAGCTTTTGATGAAAGTTTAAAAGTTCGTGATGTTGTAgGTTTATGGTCTCACGACACATCTCAAGTATT 
Ag GAAAT ACT AAAAGTAAAACTTTAC GAAT C GAAAAT GACa AGAAAGAATTAC G ATT T GAAT T AGAT AT T C CT AAT AC AA 

CT GTT G GGAAT GAC GCATGGGAATTAATTAAgCGTGGAGAT GTT GAT GGAGTTT CTTT TGG GAT GAAG GTT AC AAAAGAC 
AAAT GGT CAT C GGAAGAAC GTGAAAAT G GAAAGCTTTATAAG C GTT CGATT TT AAAT GCT GAACT A.T AT GAAAT AT CAC C 

GGTTGCATTCCCTGCATATCCAACGAATGAAGTAAGTGTACGTTCATTGGATGATTTTAAAGCTGGAGAAAAGCGAGTAG 
CT GAT GAGTT TAGGAAAAGAAAACTACAAATCGAACTAGAGCTTATATAAGGCT CTTT TTTT ATT GAT AAATTTAAGG AG 
TGATTTGAATGTCAAAAGAATTACGTGAATTATTAGCTAAGTTAGAAGGGAAAA^.GGAAG7\AGTACGCTCTCTTATGGGA 
GAAGATAAAGT GGCAGAAGCAGAACAAATGATGGAAGAAGTGCGATCACTT CAGAAAAAAATT GATT TAC AAC GCT CATT 
AGAT G AAGC AG AAACG G AAG AAC GAAAT AAT G GAAG AGAAGTT G AAACACGTAAT GTAG AT G GT GAAAT G G AAT ACCGCG 
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ATGTGTTTATGAAAGCATTACGCAATAAACCATTAAATGCTGAA.GAACGTGAATTTCTTGAGGATGATTTAGAACAACGT 
GCC AT GT C AG GAT T AAC T G G G G AAG AT G GAG G AC TT GT C AT CC G T CAAG AT ATT CAAAC G C AAAT C AAT GAAT TAG C T C G 

TTCATTTGATGCGCTTGAGCAATATGTAACTGTTGAACCAGTGCGTACACGTTCAGGATCACGAGTATTAGAGAAAAATT 
CAGATATGATTCCGTTTGCTGAAATCACTGAAATGGGTGAAATTCCAGAAACTGATAATCCGAAATTTTCAAA.TGTACAA 
TATGCAGTGAAGGACAGAGCAGGTATTTTACCGTTATCTCGTTCATTACTTCAAGATAGTGATCAAAACATCCTAAAGTA 
TGTGACTAAATGGCTAGGTAAGAAATCTAAAGTTACACGTAATGTGTTAATCTTGGGCGTAA.TTGAAAAGTTAA.CAAAAC 
AAGCAATCAAATCTCTGGATGATATTAAAGATGTATTAAATGTTAAATTAGACCCAGCGATTTCTCCGAATGCGATTTTA 
CTTACAAACCAAGATGGATTTAATTATTTAGACAAATTAAAAGATAAAGACGGAAAATATATTTTACAGTCAGATCCAAC 
GCAAAAAAACAAAAAACTATTTGCTGGTACTAATCCAGTCGTTGTTGTTTCGAATCG^^ 

CAGCTAAAAAAGCGCCACTTATTATTGGTGATTTAAAAGAAGCTATTGTTTTATTTAAACGTGAAGATATGGAACTGGCT 
TCTACAGATGTAGGTGGTAAAGCATTCACTCGTAATACATTAGATTTACGCGCAATTCAACGTGATGATGTGCAAATGTG 
GGATAATGAAGCAGCAGTTTACGGAGAAATCGATTTAAGCGCTCCTGTTGAACAACCTCAAGGGTAAACTAAGGAGGCAT 
TTGAATGCTTGTTACCTTAGAAGAAGCTAAAGAATGGATTCGAGTGGACGGAGACGATGACCCAACTATCACTATGTTAA 
TTAAAGCGGCTGAATTATATATTTACAAAGCAACTGGCAAAACATTTACTCAAACAAATGAAGATGCTAAGTTGCTTTGT 
TTATTTCTGGTGGCTGATTGGTACGGAAATCGACTACTTGTAGGTGAAAAAGCCAGTGAAAAAATCAGaACCATTGTTCA 
GAGTATGATATTACAGCTCCAATATGCTTCAGAGCCTCAGGAGGAAAGAAAATGAATCCTGCAAAATTAGATAAACGGCT 
TACATT T CAAGTAAAAGAT GAAAAT GCAAAAGG GC C T GACG GT GAT C C GATAGAT G GATATAAAGAT G CT TT T AC C GT AT 
GGGGCTCTTTTGTTTATTTAAAGGGAAGGAAATACTTTGAGGCAGCAGCTGCTAATAGTGAGGTTCAAGGAGAAACAGAA 
AT C AG AAAT C G GGAT GAT GTAAGT G C AGAT AT GAAAAT T AAGT ACAAAAAC GTGATTT AT GATATT GTTT C C GTTATT C C 
AACTCAAGATCATACTTTATTAATCATGTGGAAACGTGGTGAAATGaATGGCTGATGGTaTAGATTTAGATTTATTAGGA 
TTTGATCGTTTAGTTACTGAATTAGACCAAATGGGGTTACGGGGAGAGAAAATTGAAGATAAAGCTCTTGCAGCTGGTGG 
TGAACCTATTCGTAAAGCCATTGCAGAACGAGCGCCAAGAAGCCCAAGCCCCAAAAAACGATCTAAAAGTGAACCGTGGC 
GTACAGGGCAACATGGTGCAGACCAGATAAAAGTAACAAAAGCTAAACTTGAAGGTGGAATAAAAACAGTAAAAATAGGT 
C T TAATAAAG C GGAT CGTTCCCCGT G GTT CT AT TTAAAGTT C CAT GAATG G GGTACAT C C AAAAT G C CAG CAC AT CCAT T 
TATAGAGC CG G GTT TT AAT GCTT CAAAAGC G GAAG CT GTAC GT G CTAT GACAGATATTTTAAAGAAC GAAAT GAG GT T GG 
AT T T GT G ATAAAT TTAAGAC C T GAT ATTTTACAAG CT CTT GAGAAT GAT CAAGAG CTT GTT T CATT GTTG GGT GG GAAAC 
GAATTTATT AC C GTAAAG CAAAGAAGG C AGAAGAGT TT CC GC GAAT T AC GTATT T T GAATTAGAC AAT AGGC CAGAT G GA 
TTTGCAGATAATCAAGAGATTGAAAGTGAAATCTTGTTTCAAGTTGATGTTTGGGCAAAGAGTAGTACAACAGCAATCCA 
T CAAAAAGTGAAT GAAAT CAT GAAAAGAAT T GGTT T CTCAC G CTAT GC GGTT G CT GAT TT ATAT GAAG AG GATACACAAA 
T AT TT CATT AT G C GAT GAGATT C GCAAAAG GAG T GGAATTATAAAT GGCT G GAGAAGT T GTAAGAAT TAGT T CAAC G GTT 
G GT GTAGACAAC CTT GT ATAT G C GAAAGTTTT ACAAGAT GAT T C GT CT GCTATT AAATATAC AGAT GTAAAGAAAAT G GA 
AGGTGCTGTAAAGGTTAAATTAACTAAAAAAGTAGCTTCTGAGGTTATGTGGAGCGATAACAGAAAATCAGAGATTGCAG 
AATCTGATGGCGAAACTGAAGTGGAGATTGAGGTTCGAGGACTTTCACTTTCTACAAAGGCTGACATTGAAGGGTTTCCA 
GAAGTAAAAGAT G G C GTTTTAGAT GAGAAAC GT GAAG GT GAGAAAC CATATTTAGCTATT G GTTT C CGATT CTTAAAAGC 
TAAT GATAAGTAT CG AT AT GT TT G GT T ATTAAAAG GGAAACT TT C ACAAGAG GAAGAAGAAG CT GAAAC GAAAAAAGAC A 
AAC C GAACTT C CAAAC AACAAAATT GAAAG GTT CCTTTATT GAAC GT GATTTT GAT GAT AGAAC GAAATTTACAG CAGAT 
GAAGAT GAAC CAACGTT CACAAAATT AG TT GGAGATAATT G GTTTAATAAAGTAT AT GAAAAAC CAGT GACAC AAC CAC C 
AGCAG GAAAGTAAGAG G GAGCAAAAGCT CT CT CTTTTTTATTAAATTTAG GAGGGAAAAACTAT GAAAT T AAC AT TAAT G 
AT T AAT AAAGAAAAA CAAAC TTTT AAT AT G C CAGAATTTATT C CAG C C C GCCTT ATT C GT CAG G CT C CT GAACT T G CT GA 
AATT C CAAACAAT CCT GGTCCAGAAGAT AT GGAT AAAAT GGTTCAATTC GTAGTGAAAGTTTAT GAT GGT CAATTT ACAT 
TAG AT CAGT ATT G GGAT G GT GT T GAT GC C C GTAAATT CTTAT C GAC AACTT CAGAT GTAAT TAAC G CAATTATAAAT GAA 
ACAGT G GAAG CAG CAG G G GGT AGT ACT GAAT CAGGAGAAGAAGAAAAC C CAAAC G CAT AGAG GGAGGAGGGCT AAC GTT C 
AGTGAGTTTAT GGAC GAGCTCTACCT CT CTTTATT GC GACAAGGGTACAAACACCAT CACATT GATAAT GAGAT GGAT AT 
T T GGCATTAT TT GAg ACTTAAT C GAAAAAT GC AT GAAAACG GAAAT GAAAATT AC GAAGGCT CC AAT T C AAAT GAAAT AG 
AAGT G C CAG C G GAAAACATTATT TAAC GAGG GAGGT GAG ACT AT G GC GAAT GAAAT AAAT AAT CT AGT CGTT AGACTTT C 
C CTT GATAACGTAAATTTCAGACAAGGTAT CT CGAATT CAGGTCGT GCAGT CAG GAC GT TACAGAAT GAAT T GAAAT CT G 
TAAGTACAGG7^ATGGGCGGTTTTGCTAACGCTAGTCAGCAAACACAAGCGAAAATGAATACACTCAGTAGGCTCATTGAT 
G C G CAAAAAGAGAAAGTT AAAG CGT T AC GAC AAG C CTAT GAT C AAAAT AAGGCT AAAT TAG GT GAAAAT G AT GCAGCAA.C 
C CAG C GAT AT GCTT CG C AAGTTAATAAG G C AGTT GCT GATTTAAAT AGATT T GAAAAT GAATT AAAG C AAGT AAACC GT C 
AAGCTGAACAAAAAGGGATGGATAAGTTAAACAACTCTTTAAAATCCCTACAAGCTGAATTTCAGTCTATTACAACAGGT 
A.TGGGCGGTTTTTCTAATGCGACAGAACAAACAAGGGCTAAAGTAGATGTTTTATCCCGTATGGTAGATAAACAAAAAGA 
GAAGAT TAGG GAACTT CAACAAG C CTAT AAT CGT GCT AAAACAGAAG AAG GCG AAG CGA^GT CAAT CAG CAC AAAG AT AC G 
CT GAlACAAATT CAT C G G G CAACAGCT GAACT GAAT CGATTT GAAACT GG ATT AC AG CAGT CAAAT CGT GAATT AGAAC AG 
C AAGGGAAT C G C CTAT T GAACTT CGGAAlAT CGC AT GGAGAC ATT AGGT AAT CATTT G CAAAAT GC C G GAAT G CAGAT C G G 
CAT GGT AT TTGGTGGTATGACTTACGCAATAGGTCGGGGCTTAAAA.TCA.GCAA.TCACT GAAT CAAT GAATTTTGAGCAAC 

AGAT GG C CAAT GTAAAAG CT GTTT CT G GAT CT ACT G GAG CA.G AAAT GAAAAAGT T AAGT GAATT G G C T GTTAAT AT G G G A 
GAAACAlACAAAAT ACT C CAGT GTT CAAG CAG GT C AAGGT AT CGA.G GAAT TAATAAAG G CT G GT GT T AG C TT AC AAG ATAT 
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TATTAACGGCGGATTGGCAGGTGCCCTTAACTTAGCGACGGCAGGGGAATTAGAGTTAGGTGAAGCAGCCGAAATTGCTT 
CCACAGCTCTGAATGCATTTAAAGCAGACCATCTTTCAGTTGCGGATGCAGCC^TATTTTATCTGGTGCAGCCAATGCT 
TCCGCAACTGATGTAAGAGAGTTAAAATATGGACTTTCAGCTTCATCAGCAGTAGCAGCGGGAGCCGGAATGACGTTTAA 
GGATACAGCTACAACTTTAGCGGTATTTGCACAAAATGGTCTTAAGGGATCAGATGCAGGTACATCTTTAAAAACAATGT 

TAAT GAGGTT AAAT CCTT CAAC AAAAG AAGCAT AT AACAAAAT GAGAGATT T AG G ACT T ATT ACTT ATAAT G CACAG G C A 
GGTTTTGATTTCTTAGTTAAAAACGGTATTCAACCAGCTTCCAGAAATGTAGGGGATATAGAAGTAGCTTTAGAACAATA 

TGTAATGAAAACAGAAGGTGTAACGAAATGGAATGATAAATGTGATAC^^ 

TTTT AT CAT CAAAATT CT AT GATCAACAG GGGCATA/TTCAAAGTCT AGAAAAT ATTTCAGGT AC ACTTC AT GAATCGAT G 
AAAGATTTAACAGACCAACAACGA^*GTATGGCTCTGGAAACATTATTTGGTTCCGATGCTGTACGTGGTGCGACTATCTT 
GTTTAAAGAAGGCGCCAAAGGTGTCAATGAAATGTGGGATTCCATGTCAAAGGTTACAGCAGCTGATGTAGGAGCGACCA 
AAATTGATACTTTAAAGGGACGACTTACATTACTAGA.TTCAGCGTTTTCCACAATGAAAAAGACAATTGGTGATGCACTA 
GCTCCAGTAGTTAGTGTTTTTGTTGCTGGTTTACAAAAACTTGTTGATGGATTCAACTCTTTACCTGGACCAGTACAAAA 
GGCAATAGCAATTACAGGTGGTATCGTCCTTGCTTTAACAGCTGTGGCTACAGCAATAGGTGTGGTTTTAGCAGCGTTTG 
GAATGATTGCTTCAGGAATTGGTTCTTTATCTCTTGCTTTAGCATCAGTCGGTGGGATTGCTGGAATTGCGGCTGGAGCA 
GTTGGATTCTTAGGAAGCGCGCTTGCGGTTTTAACAGGGCCAATTGGTCTAGTAGCAGCGGCTCTTATCGGAACTGGTGT 

T G^T G C AT AT AAAG C AT AT C AAAAAG C G AC T GAAG AC AGT AT CG C AT CAGT AGACCG CTTT GC T ACAAATAC AGAAG G GA 
AAGTAAGCTCCTCAACAAAGAAGGTTCTTGGCGAGTATTTCAAGCTGTCCGATGGTATTAGACAAAAGTTAACTGAAATT 

AGATT GAAC CAT G AAG TAAT AAC AGAAG AAC AGT C G CAAAAGTT G ATT G GT C AAT ATGACAAATT AGCTAAT ACAAT CAT 
TGAAAAAACC AAC GCAAGGCAGC AAAAAGAAATTGAAGGGCTTAAAAAGTTCTTT GCT GATT C GTAT GTATTAAC CGCT G 
AAG AAGAGAACAAAC GAAT C GAACAGTTAAAT CAGCACTAT G AAC AAGAAAAGC TAAAAAC GC AAG AAAAAG AAAAT AAA 
ATTAAAGAGAT CTTACAAACAGCGGCTAGAGAAAAC AGAGAATTAACGACAT CCGAAC GTATCTCTTTACAAGCATTGCA 
GGATGAAATGGACAGAGTTGCTGTTGAGCATATGTCTAAAAATCAAATGGAGCAGAAGGTTATTCTTGAAAATATGCGTG 
TGCAGGCTAGTGAAATTTCAGCTAGACAGGCAGCGGAAGTTGTAGAGAATAGCGCCAAAGCAAGAGATAAAGTTATTGAA 

GAT GC GAAAAAGAC C C GT GAT G AAAAAATT G CAGAG G C GAT T C G C CAAC GT GAT G AAAAT AAAAC AAT CAC T G CT GAT G A 
AGC GAACGCAAT CATTGCAGAGGCAAAACGT CAATATGATAGTACAGTTTCTACAGCT CGAGATAAACATAAAGAAATTG 
T GAG T GAAG C AAAAG C G C AAG C T G GT GAACAT G C AAAT C AG GT AG AT T G G GAAA C T G G C C AA GT AAAAT C GAAAT AT C AA 
G CTAT GAAAGAC GAT GTTATT CGAAAAAT GAAAGAAAT GT G GT C G GAC GTT AC CAACAAATAT GAAGAT AT GAAAAACT C 
TGCAAGCAACAAAGTAGAGGAGATAAAAAATACAGTTT CAAGAAAATTTGAAGAGCAGAAAAAAG CT GTTACTGATAAGA 
T GT CAGAAAT AAAAAGTAGTATTGAAGATAAGT GGAATACAGTT GAAAAGT TTTT CAGTT CTAT A7\ATTTACGTT C CAT C 
GGTAAATCAATCATAGAAGGGCTTGGCAAGGGAATAGATGACGCTTCAGGAGGTCTGTTTAGTAAGGCTGCGGAAATTGC 

AAGT GATATTAAGAAGA CTAT TT CT G GAG CATTAGAAATTAAC AGT C C GT CTAAAGTGAT GATT C CAGT C GGT AGC GC AG 
TTCCAGAAGGTGTTGGGGTTGGTATGGATAAGGGAAAACGATTTGTTGTGGATGCAGCAAAAAATGTAGTCGGAACTGTT 

AAGAAACAAATGGGGAATATGCCATCTGTTTTTGATTTTGGATTCCAAACAAATCAATATAGTATCCCGCAAAATACATT 

TAGCGATTTCAGT GGATATATGCAACCGCAATTAT CTTATAACAAT CCATCTATGGCAAAAACAATATT CCCAAATAGAC 

CAG GT G GAGAACAAGAACT GAATTTAAC C GTAAACAT GACTAAT GT TTTAGAT GGAAAAGAG CTT G CAAACGGAAGTTAC 
ACCTATACTACAAAACTTCAAAATCGTGAACAAAAAAGAAGAGCGGAATTTTAAGGGTGGTGAGCACGTTGGGG7\AACTT 

AGTTTTACTTTTAATAATATTAGAAAAGATTATATTCAAATGCTAGTTGGTVAGTVAAACGTCCTTCATGGGCTCCAGTAAA 

AAGAAGATTAGTAAGAGTCCCT CAT CGCGCAGGGGCT CTTT TACT TAATACAGAAACG GAG GAAC GTCGT ATT GACGTTC 

CTCT T GTTATTAAAG CGAAAAAAGATAT GGC AG ATT TACAAAAGTT AAAAGAAGATTTAG C G GATT G GTTAT AT ACAGAG 
C AAC CT G CTGAACTT AT TTTT GAT GAT GAGTTAGACAG GAC T TAT T TAT CAT TAATT GATG GTT CT GT CGATTT G GAC GA 
AATAGT CAATAGAGGTAAAGGT GTT ATTACTTT T GTT T GT C CAAT G CC GTAT AAAT TAG G GAAAAT CAAT ACT CACAAAT 
TTAC G CAAGAGT G GT CT ACAGAAACAACTT CTT ATT TTACTAATAAAG GAAGTG TAGAAG CT CCAG CATTAATT GAAAT G 
ACAGT GAAAAAACCAAGTACCTTTTTAGATGTATG GTTT GGAGAGTATCCGCATAATCGT GATTATTTCAGAATAGGCTA 
C C CT CT GACT G T GGAAGAAAC CAC GGTACAAGAAC GAGAAAGAGT CAT GT G G GAT GAAAT G G CT ACT C CTAT AG GAT G GA 
CAC CT GTTAC T G GACAATT C GAGGAGAT GAAAGGGACAG GT AGTTT TAAAT CAAGAGGTG GT CAT GC ACTAT ATT GT GAA 
GATTACGGAAAAGAGACAGGATTCTACGGTGCTATAGCCAAGAAAAACATTCCGGGCGGCCCATTACAAGACTTCGAAAT 

G GAGG CAT G G G T GACT TTAAAGT C CAAAAACATAAGC GAAAT GGGACGT GTT GAAGTT CTT CTTTT AGAT GAGAC GAGT A 
AC GT GATAT CCCGCATCAATAT GAAT GAT CTATAT GCGACCGCTGAAATTACAAGGGCGCATAT GACAATT GGAAATAGC 
GGAACACCCAATAGTTTTCGAAAATTAGTTGATACAAGTGGATTTTATTCGACAACATTTAACCAATTCCGAGGGCGTTT 
ACGTATTGCTAGGCGGGGGAAGGTGTGGTCTGTATATGTGGCTAAATTTATAGATGGTACAGAAAAAGATGGAGCTTCAC 

TT GTAGAAC GT T G GAT T GAT GAAACAG GAAAT C C GAT GACAGAACGT AAAAT T G C ACAAGTTAT GAT T GCGAT TT GC AAG 
T G G GAT AAT CAT CAAC CT ATTAAC GAAAT GCAAATT GAT GATTTAAAAATT T GGAAGGT AAACAAAGTT C CAT CT AAT GC 
ACAACCATATAT CTTTGATACTGGAGATAAAATT GTTAT CGAT ACT GAGAAAAGT CTTGT CAC GAT CAAT GGGGAGAAAG 
CA^T CAAT AT AAAA GAAAT CTTTAGTAATTTTCCTGT C GTAATACGT GGT GAAAAT CGTAT CGAT ATTAT GCCGCCTGAT 
GTAAAC GCAAC AA T C AGTT AT AGGGAGAGATATAGAT GAGAACAC CAAGC G GGATTTT GC AT GT T GT GGATTT TAAAACA 
GAT CAAAT C GT C G CAG CT AT CCAAC CAGAG GACT ATT GG GAT GACAAACGGCATT GGGAACTTAAAAAT AAT GTT GACAT 
GTTGGATTTCACCGCATTTGATGGAACAGACCATGCAGTTACCTTACAACAACAGAATCTTGTTTTGAAAGAAGTTCGCG 
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ATGGAA.GAATCGTACCATATGTTATTACAGAGACTGAAAAAAATTCCGATACACGATCTATTACCACATATGCTTCAGGA 
GCTTGGATTCA7\ATTGCGAAATCAGGGATTATAA7UVCCACAACGGATAGAGAGTAAGACGGTTAATGAGTTTATGGATTT 
AGCACTCTTAGGTATGAAGTGGAAACGCGGAATTACTGAATATGCTGGATTTCATACAATGACCATCGATGAATATATTG 
ACCCACTCACTTTTTTAAAGAAGATTGCATCTTTATTTAAACTGGAAATTCGATATCGTGTTGAGATTAAAGGTTCAAGA 
AT CAT C G GTTGGTATGTAGATAT GATT CAAAAACGTG GT CAT GATACAGGCAAAGAAATAGAATTAG GAAAAGATTTAGT 
CGGTGTTACGCGAATTG7\ACATACACGTAATATTTGCTCTGCTTTAGTTGGATTTGTAAAAGGTGAAGGTGACAAAGTAA 
T C ACT AT T G AAA G CATTAATAAAG GT CTA C C CT ATAT C GTAGAT G C AGAT G C GTTT C AAAGAT GGAAT GAAC AC G GACAA 
CATAAATTCGGTTTTTATACACCAGAAACAGAAGAATTAGACATGACTCCAAAACGTTTACTGACGCTTATGGAAATAGA 
ATTGAAAAAGCGTGTCAACTCTTCAATTTCTTATGAAGTGGAAGCACAATCTATTGGTCGTATTTTCGGTCTAGAACACG 
AATT AAT T AAC GAAG G C GACAC GATTAAAATTAAAGATACAG G G TT TACAC CAGAATTAT AT CT T G AAG C G C GAGTAATA 
GCT GGG GAT GAAT C TTTTAC AGATT CAAC GCAAGATAAATAT GAAT T C G GA GATT AT C G T G AGATAGTTAAT CAAAAT GA 
GGAATTAAGAAAAATTTATAATAGAATCCTTAGTTCGCTTGGTAATAAACAAGAAATGATAGATCAGCTAGACAGATTAG 
TTCAAGAAGCTAACGAAACCGCTAGTAATGCAAAGAAGGAGTCAGAAGCAGCAAAAACACTA 

AAT ATT AAAAAT AAT AC C GTT G AAATT AT AGAAT C TAAG AAT C C AC C GACAAC AG G T C T T AAAC CAT T TAAAAC G CT T T G 

GCGTGATATTAGTATCGGAAAGCCTGGTATTTTAAAAATATGGACAGGTACAGCGTGGGAATCGGTTGTACCTGATGTTG 
AAT CT GT AAAAAAAGAAAC AT T AGAT C AG GTTAATAAAGATATC G CAAC CACAAAAACAGAGTTAAAT CAAAAG GTT CAA 
GAAGCCCAGAACCAAGC GACT G GT C AATT CAAT GAAG T GAAAGAGAGTT TACAAG GCGTT AGT C GTAC GATT T CTAAT GT 
TGAGAACAAACAAGGTGAAAT C GATAAGAAGATT ACT AAGTTT GAAC AAGATT CAAGT G GATT TAAAAC TTCAATT GAAT 
CGTT AACGAAAAAAGAT ACT GAAATT AGTAATAAATTAAAT ACAGTT GAGTCT AC T GT G GAAG GTAC GAAAAAGAC GATA 
TCTGAGGT ACAGCAAACAACTAATGATTTAAAGAAAAAAACTACT GAAATAGAAGAGAAGG CT GGAAAAATCAC CGAAAA 
ACTT AC AAGTTT AG AGACAAGAGAAGTT AAT GTT CGAAAC TAT GT AATT AAC T CTGATTTTT C GAAT GTTAC AAATT CTT 
GGATTGGAATTACTAATGCAACTCTTTTTAAATTTGTAGATGTGAATATTTCGGAAGCCTCCGCTATTAAGAAAGGTTTA 
CAAAT AAC AAGT AAT AAAGCTT TT GTTT AT CAGAAGTT AC C C GC AGAC GTGTTTAAAAAGAAGAAG G G G ATAG CTT CTT G 
TTATATAAAT GTA.T CAAGT TTTACACCTGGTACAGAT TAT CCACGTTTATATATGAGATT CACCTATGACCAAAACGGAA 
C AGAAAAACAAT ATT AT GCCATTTTAAAACAACAAGAAGTAACTAATGGATGGATTAGGATTT CTATAC CATTTGATACA 
ACT GG AT AT AC AG GT GAATTAAAAG AAGT AC GT GT AAAT AT AGCT AC C G CT GACACAAC TACTAT C GAT GCAAC GT T CAC 
T G GAAT AAT G GTT ACATT CGGT GACTT AATT GAAT CT T GGAAT CT CG CT CCAGAAGAT G GAGT AACACAAGGTGTT TTT C 
AAT CT AAAACAAC C GAGATT G AAAAAAGT GT GGAT GGT GT AAAAACTACT GTAACAAAT GTT CAAAATAG CCAAGCT GGA 
TTT GAAAAGCGCATGTCTAATGTGGAACAAACAGCAACT GGATTAT CTT CTAC CGT AAGT AAT TT AAAC AAT GTAGTAT C 
C G ATC AAG GAAAAAAGCT TACT GAAGC AAAT ACAAAACT C GAAC AGCAAGC AAC C G C GATT GGAGC AAAAGTT GAGC T T A 
AACAAGTAGAG GATTATGTTGCTGG GTTTAAGATTCCT GAGTT GAAACAAACAGTT GATAAAAATAAACAAGATTTATTA 
GAT GAAT T AG CC AAT AAGCTT G CAACTG AACAAT TTAAC C AGAAGAT GACT CT GATT GAT AAC C GTTT CACTAT TAAT GA 
ACAG GGTAT CAAT G C C GC AG CAAAAAAGAC AGAAGTAT AT ACAAAGACGCAAGCAGAT G GAC AATTT G CTAC AGATT CTT 
AT GTAAGAGATAT GGAGT CGC G C CT G C AGCT AACAGAAAAG GGT GTT AGCAT AT CT GT AAAAGAAAAT GAT GTAAT C GCA 
GCCATTAACAT G AG T AAAGAAAAC ATT AAGT TAAATGCTG CAC GAAT AGAT TT AGT T GGTAAAGTTAAT GCGGAGT GGAT 
TAAAGCT G GATTGCT GAGCGGTTGCCAAATTAGAACATCAAATACGGATAACTAT GTTAGTTTAGAT GAT CAATTTATAC 
GT CT CTAT GAAAGAG GAGT TGCTAGAG CAT T T CT GGG GC ATT ACAGAAGATCAGAT G GT GCAGT ACAACCGACTT T CAT C 
TTAGGTTCAGATGAAAAGACTAACGCTCCGGAAGGTACTTTGTTTATGTCTCAAGCAGGTGCAGGATGGTCAGGGGCTTA 
TG CGAGCATT GGTATTAGCAAT GGCATAGTT GAT GGTGC AGT CCAAAAGT CTGTGT AT T GG GAGTT GCAAAGAAAC GGAC 
TAAGTGTTCTAAACGCTAATGATTACCATGTTTTTTACGCTGGAAATGGAAATTGGTATTTCAGAAGAGGGAAACCAGGG 
T T GT AT CAAACTTC GT TAGT C GTT GAAGAT AAT AGTAC AGATT CT GATTT AAGATT ACCTAAT GTAACT AT AC GTAAT AG 
C CGT GCAG CAG GAT AT ACAG GAGTTATT CAATT GAAAT C C CCT GT TACT CAAAAT G GAT G G G GT G CT GT T CAAG GGAAT T 
TTATGACTCCTTCATTACGGGAGTATAAATCTAATATCCGTGATATTTCTTTTTCCGCCTTAGAAAAAATTAGAAGTCTT 
AAAATTAG AC AAT T TAATTATAAGAAT GCT GTAAAC GAACTATAC C GGAT GAGAGAAGAG AAAAGTCC CAAT G AT CCACC 
ATT GACAACAGAAG ATATTAAAACATACTAC GGTTTAAT C GTAGAT GAATGT GAT GAAAT GTT T GT G GAT GAAAGT GGGA 
AAG GAATT C ATTT GTAC T C ATAC G CATC C ATT GGAATTAAAGGTT TACAAGAAGTT GAT G CAAC AGT AC AG GAACAG GAG 
GTAGAAATAG CAAAT CTAAAAT CACAAATAGCTAGT CAAGAAGAT C G GATAG CAC GATT AGAAGAATT ATT ACT AC AAC A 
AT TAAT AAAT AAGAAAC CAGAG CAGC CATAGGCT GGT CTTT TTATTTT G GC CAAAAAGGAG AGGAAAAG AT GGAT C GT AT 
TGATGTATTACTAAAAGCATTTATAGCTGCGTTTGGTGGCTTCTGTGGGtATTTCTTGGGAGGATGGGATGCAACATTGA 

AAATCTTAGT GACAAT G GTAGT TATT GATTAT TTAACT G GCAT GAT T G CAG CAG G GTATAAC G GAG AATT AAAAAGC AAA 
GTTGGTTTCAAAGGCATCGCCAAAAAGGTGGTGCTTTTTCTTTTGGTCGGAGCGGCCGCTCAACTAGACTCGGCACTTGG 
AAGCAACAGTGCAATCCGTGAAGCAACAATTTTCTTCTTCATGGGTAATGAATTACTTTCACTCTTAGAAAATGCCGGGC 
GAATGGGTATTCCACTCCCACAAGCATTAACAAATGCAGTTGAGATTTTAGGTGGTAAACAAAAACAAGAAGAGAAAAAA 
G GAGAT GTTC AGTAAT G GAAAT CCAAAAAAAAT TAGTT G AT C CAAGTAAGTAT GGT ACAAAGT G T CC GTAT AC AAT GAAG 
C CTAAATATAT CACT GT T C ACAACACATAT AAT GATGCTC C AG CT GAAAATGAAG T GAGT TACAT GATT AGT AAC AAT AA 
TGAGGTGTCGTTTCATATTGCAGTAGATGACAAGAAAGCGATTCAAGGTATTCCGTTGGAACGTAATGCATGGGCTTGCG 

GAGAC G G CAAT G GT T C G G G GAAT C GT CAAT CCATTT CT GT AGAAAT CT GTTATT CAAAAT CAG GAG GAGATAGATACTAT 
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AAAGCTGAGGATAATGCTGTTGATGTTGTACGACAACTTATGTCTATGTACAATATTCCGATTGAAAATGTTCGAACTCA 
TCAATCCTGGTCAGGTAAATATTGTCCGCATAGAATGTTAGCTGAGGG7VAGGTGGGGAGCATTCATTCAGAAGGTTAAGA 
ATGGGAATGTGGCGACTACTTCACCAACAAAACAAAACATCATCCAATCAGGGGCTTTCTCACCGTATGAAACCCCTGAT 
GTTATGGGAGCATTAACGTCACTTAAAATGACAGCTGATTTTATCTTACAATCGGATGGATTAACTTATTTTATTTCCAA 
AC C GACT T CAGAT G CACAACTAAAAGCAAT GAAAGAATAC CT T GAC C GTAAAG GTT G GT G GT AT GAAGTTAAATAAAAC A 
AAAGAATAGTTTTATGAACAAAAATAAGAGCCGTCCTGTTGGGCGGCTTTTTTTTATTGCTCAATTACTGTTGCACTAAT 
TTTAGGCATTCCTGTTTTATCTTTTTCGTCGTAGGCGCCATAGATTGTTACTATTGATCCTTTAGATATTTTTAATCCGT 
TTTTAAGTGTTATTTCATTTTCGTTCGTTTGCACTCCACTTTGGACAATTTGAATAGTGTACATGCCTTTGCCGTCATTT 
TCGTTTGTGCTTATGACAAATGAAGGTAATGCTGAAGACTTAAGTAATAAATCTACCGTTCCGGTAGCTTTAAGCCTTTT 
TCCTTTTTCGTATTGATCTCCATTTGCCTTAACAAAACTAACTTCTTCAGCATCTTGCTTTATCTTCTTATTTAACTCAT 
CCTGAGATGTTAAATCTTTTTTAGTTTCTGGTTGAGATTTGACGTTCGTTTTTTCACTTGATTCACTTTGTTTAGAAGAA 
TCACAAGCTGTTAGACCTAACAATAAGGTACTTCCAATGCAAATACTTATAAGTTTTTTATACATTTTCATTCTCCTCCT 
CTATCCAAATTTCTTCCATGTGCAATTTTAATTCCTTTGCAATTTTATAGGCTGTAAGAAAGGTAGGTAGCGTCGTGTTA 
TTAACGAGTGAACTCATTGTAGTTTGACTAATTCCAATAAGTTTTGAAAACTCCTTTTGACGTATTTCTCTTTCAGCAAA 
AAT AACAC GAAGTT T ACAT T TTAAT C G C ACAATAT CAC CT CTT TAAT T ATATACAAT T C G CAT AT GGAAAT GTGTCCTCC 
TTTAATTTAATCAACGAACATTTAGAAAAGTTTAAATGGACAGGCAATATAACTCTTTCTAAGTCATATACCTATATCAA 
GACCACGAGGAAT AC CAAGT G GAACTAAGGAC AT CAAGAGGGGAGAGGAT T AC AT G CGT T G G C AG TAT AAT CACTT GAAT 
ACAACT C CAT AT CT T CAT C CAT C CAAAGAATT AT GTT C AAT GTACAAT G GAT C GAGAT CAAGAG C AGAG AC G GAAT CAAT 
TT T AAAT CACAT GAAAAAT CAT GAAGTTTAT GAT C GAAAAGAAT ATAAAG GATATT T C AGT TT GT CAC AG GT ATTAGAAG 
AAGAT CT AT AT GGAGAG GAAGAAGAT GTT TTAAACT G G GAAATT CTAAT G G ATT GT TAT GAT GTAGT T CTTAC AAGAAAA 
GGT AT T G C ATTT CGT GAAAAAGAAGAGGAG GAACAAG CAT GACT CTT G CT G GAGAAGC GAT T ATT ATT T G GAC GG CAAC A 
GGGTTGTCAGTAGTTGCAATGAAGGCAGCAGAAAAAATGGGGAAAAGTGTTCCACATTGGCTTCCACGTGTCACTTTGTA 
CACAACACTTACAGGCTCGTTTCTATACCTTCTACGTTATGTTCTCGTTTTATTTCTATGAAGGAATACGATGTGGAAAC 
TTTTCATTCCTTATGTCA.TAAGGAGTTTAGCTTGTATGCACGTATTCCTTGAAACAGGGATATATACCCTCTATAAGAGG 
GA.TATAAGGAGTGATTTTATGCTGGAGTTGTTATCAGTACCATTCGCAGGTTTAATTTTCGCCATAGTTGGCGAAAGGCT 
CAAAG GAAG AG AG AGT GAT CGAAAGAAAAT ACAAGTTTT TTTT GAAGTAAG CGGAATT G C GAT AC GT AGAG AG GACAAAT 
T ACAGT AT C C AGTT TTT CTT GAAC AAAAAG AGGAT GAC C GAAGT ACAACTT AT AT ATAT C G GTTGC CT GTAGGAAT GC C G 
AGT AAAATTA-TTCAGAAGGT CG AGGAT GTT GTCTCT GAAGGGCTAAGTAAACCT GTCCGAATT GATTATGATAATTACAA 
GCTAAATATTCGTGTGTTTCATAGGGATATACCGAAAAAATGGTCATGGTCTAAAGGTTTGGTTGCAGAAGGAAGCTGGT 
GT GTT C CAAT GGGC CAAAGTTT AG AAAAACT T ATCT AT CAT GATT TT GATAAAACACCACATAT GACACTAG GT G GT CT G 
AC AC G GAT G GGAAAAAC GGT AT TTT T AAAAAAT GT AGTTACTT CT CTT ACT TT AG CACAAC CAGAAC AT ATTAATTTAT A 
CATTATTGATTTAAAAGGGGGCTTGGAGTTTGGGCCGTATAAGAATTTAAAACAGGTAGTTTCTATTGCT GAAAAGCCCG 
CAGAAG CTTTTAT GAT ATT AAC TAAT AT C CT C AAGAAGAT GG AAG AGAAAAT GGAAT AT AT GAAAT GT AGACAT TAT AC G 
AAT GTTGTAGAAACAAATAT C AAAGAG C GTT ACTT C AT AAT AGTAGAC GAAGGAGCCGAACT T T GCC CAGAT AAAAGT AT 
GAAAAAAGAACAGCAAAGGTTATTAGGAGCGTGTCAACAAATGCTCTCTCATATAGCGCGCATAGGTGGTGCTTTAGGTT 
TTAGATTGATTTTTT GTACAC AGT ACCCGACAGGGGAT ACATT ACCGCGCCAAGT AAAACAAAATAGT GAT GCGAAATT A 
GGCTTTAGATTACCGACTCAAACAGCATCAAGTGTTGTTATAGATGAAGCGGGATTAGAAACGATAAAAAGCATTCCCGG 
AC GC G C GATT TT CAAAAC C GAT AGACTT AC AGAAAT AC AAGT GCCTTACAT T AGTAAT GAGAT GAT GT G GG AGCATTT AA 
AAGGATAT GAGGT GGAGAAACATGAGGAT GCAAACGCATAT GCAAAT CAACCGTCAAAT GGCGAT ACTTGCGACGATTAG 
AAAG CT ACAGT T T GCAACGAGAAG G CATTTAAT GAGT ATT C AT GAAAT G GGT GGAAT AAG AAAT GCAAAT CGAATT CT GA 
AAGATT T AT CTATTTAT ACAAGTAAG GTAGTTTACAATAAAG AGC AT GT AT ATT ATT T AAACCAAT CAGGACAT AAGTT G 
TTTGGCGAAGGGAAAGTTGTACATCATGGTAAAGTTACACACGCTCTTTTACGTAATGAAGCTTGGTTAAATTTATATTG 

T C CTGATGATT G G CAAGTAGAAACT GAAAT T AAAT AT AT AAAG GATAATAAAlAAGAAAAAAAT AAT T CC AG AT GT GAAAT 
T T C GT GAT GAG GAC AG AAT AC TT CAT G CT GTAGAAATAGAT C GTACT C AGAAAAT GAT AGT GAAC GAT GAAAAAT T AAAA 
AAAT AT GAG GAGTTAAC G C AG AT TT ATAAAC AGAAG CAT AAC G G GAAAGT GC CAGTT ATT C ATTT CTTT ACAAT CACAAA 
ATATAGAGAAAAGAAATTAGAAGAACT GGCAAATAAATATAATGT GTTTGTAAAAGTAT ATGTAAT CGCTACTACTTAAT 
GATGAAAAAAAGAGCTGATCATTTTCGAATGATTAGCTCTTTTTTATGTATTGTATTACGTCGTCTATTTTGTAAATTTT 
ATTAATT CCTTTT T CT GCAG CAAT G GCAT T TAAAGCAT CAAT GATAG CTT CAAG C GAAT CAAAAC GAAC AG CAT T AGC AT 
TAC CATT CACTAAAT C ACTAAT C GT GT T GT AT CTTACTT GG GATTC T GT AGATAATTTATT TTT AG T GAT C C CC AATT C A 
TCTAAAGAATTTCCGAGTGTGAATTTCATTTTATTCTCCTCCGCAGCACTGGTTATCTTGTACTCATTTTACAACATCAA 

T C GAAATTAGTAAAACTTTT TT C GT T CAACT AT T GAC GTT GAATAATT AGAGAGTTATAAT T CAAC T TAAAT AGTT GAAC 
TAAT T TAGT T GAACT TAAAAG GAG GAAC AAT TAT GAAT C GAG TAAAT GATTAT TT T G GT TTAGAAAGTAAAT CAGATT GC 
ATTTGGTTTTATGGTTTCTTCAGTATATCTAC GATTTTATTTTTAATCGATATGATTATTGCTCTTATATAAGGAGGGGA 
GAAAATGCTT AG CTCAG CAAACTATAC G CAAT AT AAAAAAT TAC AAT CATT C CGAT CAGTAGAA GAGAT GAAT GAAG CG A 
TTT GTTCTTTT TTATACAAAC AT ACACAT GAATTAT C C GAAT CAG CAATAAAAGTATT GAAATTT CT AG CAAG GCAC TCT 
TGTAAAATCCCAGGTGTCTCTTTCTTGAAGGTAGGGACAATTGCGGAGGCATTAAATATAAGTGATCGAACTGTTCGCAG 

G GTAC T AAAAG TAT TA GAG GAT TTT GAAGTAG TAAC T AGACAT AAAACAATT C GAAC G GAAG GAAAAT T AC GT G GAG GGA 
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ACGG.ACATAACGTCTATGTCCTTCTAAAAAAATATAGTGTCACACCGAATGTCCTACCGAAAATGTCACAGCGACAAGAT 
GAAGAAAACCTTACAGAATCTVAAGGTTTCAGATACAAAAAC^GACAAGGAAGCTAAACTTTCTGAATCACACCCTCTAGA 
AGAATTGAAAAGCGAATTAAACGTAAAAGAAACGTCAGCAAGGGAATCTA7VAGAAATCGAATTAGAGGATCTAGATGAAA 
CTTTTACACCAGAAAATGTACCAAGCCAATTCAGAGATGTGGTAGCTCCATTCTTCAAATCAGCAGATAAAATTTATAAA 
TT GTAT CAT C GAGTAT TAAT AG CTTATAAAC GTTCAAAAAT AGACAAG C CTAT T GAACAAGT GATAAAT CAAG C CATT CA 
AGCATTCAAAGAAACTGTCTTCGCAGAAAAAGCAAATAAAATTAGAAGTACTTTTGAAGGTTATTTTTATAGAATTGTTG 
AAAGTAAATT T GTAAT G G AG AGAAG G AAAG AAT GT C G A G GATTAT TGTTC GAT T G G T TAAAT GAAT AAT ATAAAAT T G C C 
CACAGGGAAAAATATATATATAATTTAATTATCATATT CTTAGTAAATAAGTG GGT GAAAATTTTGAAATAC G CTGTTTA 
TGTACGAGTTTCAACGGATAGAGATGAGCAAGTTTCATCTGTTGAAAATCAGATTGATATTTGTCGATATTGGTTAGAAA 
AAAACGGATATGAGTGGGATCCAAATGCAGTATATTTTGACGATGGTATTTCTGGTACAGCTTGGTTAGAACGTCATGCG 
ATGCAACTAATATTAGAAAAAGCAAGACGAAATGAATTGGATACAGTCGTATTTAAATCTATACACCGTTTAGCAAGGGA 
TCTAAGGGATGCCTTAGAAATTAAAGAAATTCTAATAGGTCATGGGATACGCTTGGTTACAATTGAAGAAAATTACGATA 
GTTTATATGAAGGTGGCAATGATATTAAATTCGAAATGTTTGCCATGTTTGCTGCACAATTACCTAAAACTATATCTGTA 
T CTGTTT C T G CT G CAAT G CAAG CTAAAGCAAGAAG AG G C G AGTTTAT T GG AAAAC C G GGATTAG GAT AC GAT G TAATT GA 
CAAGAAACTT GT TAT C AATGAAAAGGAAG CT GAAATT GT AAG G GAAATTTT T GAT T TAT C C TAT AAAG G CTAT G G AT TTA 
AGAAAATAGCGAATATCCTAAACGATAAAGGCACATATACGAAGTTTGGCCAGTTATGGTCGCATACAACTGTAGGGAAG 
ATTTTAAAG AAC CAGAC GTAT AAAG GGAATTT G GT CTTAAATAGT TATAAAACAG TAAAAGTAGAT G G AAAG AAGAAAAG 
AGTTTAC ACT C C GAAAGAGAGAT TAACAATTATAGAAG AC CAT TAT CCAACAATT GTAT CAAAAGAATTAT G GAAT G CG G 
TAAAT AG C GAT AG G G C AAGTAAAAAGAAAACAAAACAAG ATACAAGAAAT GAATT TAGAG GAAT GAT G T T TT GTAAACAT 
T GT G GT GAG C C AATTAC AGCTAAGTATT CAG GT AGATAC GC AAAAG GAAGTAAAAAAGAGT GGGTAT ATAT GAAAT GCAG 
T AATTAT ATT AGATT CAAT C G CT G C GTT AACTTT G ACCC G GCT CATTAT GAT GATAT AAGAGAG GC GATTAT C TAT GGAT 
T GAAG CAG CAAG AAAAa GAACT AG AG AT ACATTT CAAT C C AAAAAT GCAT CAAAAAAGAAAT GATAAAT CT AC AGAAATT 
AAGAAGCAAATT AAGTT GTTAAAAGT GAAAAAAGAGAAGTT GATT GATTTATAC GTAGAAG GAT TAAT C GAT AAAGAAAT 
GTTTT CGAAGC GGGAT CT T AATTT C GAGAAT GAAATT AAAG AG CAAGAGTT G G CAT TACT TAAAT T AAC AG AT CAGAATA 
AGAGAAAT AAAGAAGAGAAAAAAATT AAAGAAG CTTTTT CAAT G CT CGAT GAAGAAAAAGAT AT G CAT GAGGTTTTTAAA 
ACTT T AAT AAAGAAAAT CAC ACTT AGT AAGGAT AAGT AT AT CGACAT CGAAT AT ACATT T T CTTT AT AGTTTTAAAGTT G 
GTTATT AGTTACT GTGATATTTAT CACGGTACCCAAT AAC CAAT GAAT ATTT GAT AAATT GAACATTTTTAGTAAACAAT 
ATTTTCT CAAT AT GAGAATT GCGCTTTACAGAACACAT GCT CT CATTAAT GTGAT AAAATATT CT GT AAATATAAT GGAA 
AAAGT GT T GCTT ATT GAAAT GAAGG GGGT AAGT T ACTT GAAAT TT CAT GAAAAAATT AT GG GGAT GAT T GAGGAT AGGGA 
T GACTT AACAGCTACT AGT GTAGCGT GTAAAATT G GCGTTT CAAAAC AAT ACAT GT CAAAATT CAAAAGACAAG GAACT A 
TT GGATT CT CT C AATTATT GAAGCT AG CACCT ATT TT GAGCGTTGAAG GAAAAAAAGCAAAG CAAACT AT GT C CGATT GG 
T GTT TAG AATT AGATAC CACAGAGT CTAT AAAACAAAGTTT T GAAT AT G C GT GT CT AACT C GT AAT ACAATTTT ATT GAA 
AC AATTAAT AC AAAAGC AT AGCAAAGAAACT GGAACAAT CCGAGAAT ATGTT GAAGTGT AT ACAAT CT T GTTT AAAT ATA 
TT AAGAAT AT AATT AAAGG CT C G GAAAT AACAAAG GAATT AAAG AAG AT T GGT G CT ATT AAAG AT AAG GTTTT AG AG AT A 
T TAACAAAG ATT AT G GAAT G CT AT GAAT ATT AT CAT CTAAAAAAATT CAATTTAAT GTT GGAAACT G CA.G AAACGAT T GA 
T T CACT G GTTAGAGAAAT T GAAG GAGAAC GAAAAT C C TT C ATTAAGGAAT GTT AC AATTAT C GT ATT GCT GAATT GTTTG 
CG C CGATT T T C CTACAAAAGAATAATGTAGATTTGGCTAGGAAGTAT G CCCACTT CTTAATT CAT G CTAAT GTTT GT AC A 
AAAACAGT CT CT GAC G C ATATTAC ATATTAG GTAT GT CAAAT GTATTAGAAAGTAAAGAACAAT GTT T GT TCAATTTAAA 
AAAGAGTTACTT GTTAAGTAAGGAAATTAGGGATGCT GAT ATT GAACAAGAGGCGAGATACAAT CTAGATGTT GCTAAAA 
TCTATTTTGGGGTAAAACTAGACGAAGACGCTGACAGTAGGTTATTACTGTACCAAAAAAACCCAACATGTGAATTGTCA 
ATTATAGCTCT C CAAG AT ATAATAAGAGACAGAGGAGACAAGGACTTTTTAAATT ATTT CATAGCAT GT T CTT C CGAT GA 
AAT C GAATGTTTATAC GATTT GTTTT AT CAAT ACTT CTAC CAAGCTAACTAT CT AT TTTC AGC GATAGTAGCAAAAGAAT 
T GTGTAATAGAG G G GATAAAT CT T T GTT GACT CAAT C GAT GGT TAATT TAG G GAAT GAAAAACAAAAAG G G GTT GTT GAT 
ATT GAAGAAATT AGTATTAG CAGTTTGT ACATT AT TAAC G GTT CTAAC AGT G GGATTGTCGTATAAT GAAAAT GTAC AGA 
T A.GATAAAAAAAT G CAAAT G GTT GAAATTAAAC CT G GT GG GTAAGGATAT TTT AAAGAG C GGATT AAAgAC C G CT C TTTT 
TT TGTT G GTAAACTAAAAT GAAAAAAATAAAAGTAATTT ACT T T CT GAAT TT T C C CTAGAGGAAAG GTTATAATT G GATT 
AT AG CAGTT GAG GG G GAAT AGAAAT G AAAAAAGAAAGTATT TTAAG GT CATTT T TAAAAT GT GTAATT T C ATG T T CAAAT 
AGT GAAT T T G AGT T CAAT C AAC TTATT GAAGTAGC GTTT AAT GTT GAAC AAAAAAT AAAAAAATAG C GAT GAAGTAACAC 
TACATCACTATTTTGAAGTACTTTTTAATTTTTCTAGACTTTCCATCATTGTTAGCATGTTTTGTAAGACAATTTCTTGA 
TGTTCCTCAGGTAGTTGTTCTAAACGGTTTTTTATGTGCAAGTACTTTTGATTCATATCCGCATCTAATTCACGAGAGTC 
AGAT CGT C G TAAGAGAT AAT CAACAGGTACACCAAGAAAATCAGCTGCAC GTT CAAC G GTTT CTCTA GAT GC AG GTTTAA 
ATCCAGTTTCAAACTTAGAAACGCTACCTGCAGTAACACCGATAGCTTGTCCAAGATCATGTTGTGTTAAATTCCGTTCT 
CTTCGTAATTGACGTAACCGATCTTTAAATTCCACAATAATCACCTCATAAGTGGTTTGTTAGGATTATTATAATATTTC 
CTAAAGGGAAAAT CAAT CC GAGTTATTT CTAAGAATAATATAAAATATGTGTAAAAATATATCTTGAATTTTCCCTAAGG 
GAAT GT TAAG GT GATTTACAAAGATATAGAAAG GAGT TAC CAC AT GAAAG TAATTAAAGAC GAGACAAAATTAAAAG C T G 
CATT CAAAAAAT C T GG GTATAAGTAT CAAG AGTTAGCT GAC GAATTAG AAATAT C CT GCAG CTACT GT T ACAAGCTAAT T 
AACAAT CAT AAT TACAAAAAGAAAATAT C GTAT AACT TAG CAT C CAGAAT G GC GC ATGTATTAAAT G C AAGTG TAGTT GA 
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TTTGTTTGAAGAGCAAGTCGATTTTTTTTaATACCAATATTCCCTGAGGGAACATAGGGGTGAGAGGGCCATGTCAGAAA 

TTTATTACAAAGGGTTTATCATCAAGGAAACTTATGGCGAAAGA^^TATCGAAGAAGTGTTTAAAGAAGCATATGAGTCA 

TTTTATGGGGTTGAAGTTAAGGTtGTTTVAAAAGGAATTAGGGACTAAACGCAATAGTGCAGCCAGCTAATCTTTAAACTT 

CAGTGAGAACATTCAATGAAGTCGATTATAAAATGGACAAGCCTGAAAGGAGAGAAATGAATGAAAAACGGGAAAA 

GACTAAACGTGAAAAAATGCATCTTAAATCATATAGCTTAAATCCTGATAATTGGTTGGTTTTCAAGAAAGCGGATGGAG 

AAATGCATTTAGTACACCGTTATACTAGCACAACTCGTGTAATTCCAAGTTTATAAGTTTAGGAGGGAATAAGATGGATC 

AG T TAAC AGTAG CAAGT GAATT AC GT C TTTT AG G G AGAAGAAAAGTAGCT G GAT AT GAATTTACT G G AAT C GAG G GAG GA 

TT T GGT GAAG GTAAAAAAG C AAT GTT G GTT TT G G ATATAGCTACAATT C ATAAC CAAC CATTAAAAGAAAT C AAT C GT C G 

CATTAATGATAATCGCATTCGATTTAAAGATGGTGTGGATATTGTTGATTTGAAAAGTGGTGGCTTTAACCCACCACAAT 

TATTAAACCTTGGTTTCTCAAATATGCAGATAGCGAAATCAAaTAACATCTACCTT CTATCAGAACGAGGTTACGCAAAA 

CT ATTAAAAAT T CT C GAAGAT GATAAAG CT TG G GAATTATAC GACATATTAGTT G AT GAGT ACTT CAAC AT G AGAGAAAA 
GAATCAAGTGGCTACAGATCCAATGAGTATTTTAAAACTTACATTCGAAGCATTAGAAGGCCAGCAGCAAGCAATCGAAG 

AGATAAAGT C G GAT GT ACAAG AC TT GAGAGAAAATAC AC C ATTATTT GC AATT GAAT GT GAT GAAAT CT CT ACAG CT GT A 
AAACGTCAAGGAGTCATATTGTTAGGTGGAAAACAGTCTAATGCCTATCGAAATCGTGGATTAAGAGGGAAAGTTTATCG 
TGATATCTACAACCAACTATAC CGT GAATT C GGAGT GAAAAGTCACAAAGCAATTAAACGTT GT CACTT AAATGT AGCAG 
TAAAAATAGTTGAAGAATATACACTTC CAATT GTATT GAGCGAAGAGATTT CTTTT GTAAAT GCACAAAT GGATTTTACA 
GAAAT GTAGTTAGTTAAAAC ATT CTCAACCGGTTTTTTTCTAAGTTAAAAAT^ 

CCAGTTACGTGTTATTGAGGGAGAAAAAGTGGATAAGCCAGATTATGTTGAGATaTACCTTGGAGCATTTATGAATGCAG 
TTAAT GAGTTAAAGAAACAGGAT GAGGAAACGAGAT CATTAAGCAAGGATACGTAT AAAAAAGCAAT TTTT TAT GGAGTT 
AGATACAT TT CAATAT CAAAAAAT GACAGTTT GAAT TAT GACT ACCT AAT GAAT AGATTT CT TT T AAT AAG CT ATTT AG A 
AAATTT GAT GAAG GT GTT GACGCCTAGGGATTTTATGACCATATTCCC7\ATCGAT7\AAAATTAT GAT GGCGCT CGT TAT G 

AAAT GAAAGATTACTTTTTT AC CAT GAAT GAAAT TAAAAAAAT C G GAAT G GAT AC ACCT ATT G G AG AG AAAAT C AT GG AG 
TTTTTATGGGATTACC7\AAACTTTAAAGATATAACACTATTTAACTTAGCCTCTGTAAGCATTTTAAA.TAAATTGCAGAA 

AAT G C AAG GTAAAAAAACGTT AACT GAAGAGTTT GC C G AGCGAT TAG GT AT CGAT ACTT AC AC GAAG C AT AAAGAAAAGG 
GT GGAAAAGAAt ATATTACAAAT G ACC GT ACT GGT GAGAT C CAAGAAG t T AAAAAAT CT AGAC CAAG ATATT T AAAAC CA 
GTT CAAT GATT G AT GTT ATT AAG GCTT AT AAACAAAGAAAGT AACTT GC GC CAACAAGT T ACTAAATAAAAATA CT TAT A 
AAAAT AT ACTT AT T AGAAAT AT AAC AT AC ACACT CG AT GT AT G GAAAGGGT GTT AT TAT G GCT CTTTT T AGAAAAGT G C A 
TACAGAATTTT GGACAGACGTAAAAGTATCAGAAGAT AT GACGCCAGAAGACAAATT GTTTAT GGTGTACCTTTTAACTA 
AT C CC CAT ACAACT CAATTGGGAGTAT AT GAAAT C AC ACCT AAG ATG AT AG CTTTT GAAAT CGGACTATCAATAGAGT CG 
GCTAGAGC ACT AT T GGAACGT T TT GAAAACCAT C AT AAAT T AATT AAAT AT AAC AAACT GACAAGAGAAATT G CTAT AAA 
AAATT GGGG CAAAT ACAACCT GAAT AG AGG CG GGAAAC CAAT T GAAGATT GT CTTAAAAGAGAAAT T GAT AAAGT GAAAG 
ATTT AT CT CTAAT AAAATT C ATTTT AGAAC AT AC AG ATC AT G CAGCTT T AAAAAGAAAAAT CAAT C T TT AT G C G GGTTTT 
GACGATACGTCCCACGATACGTTAGCGATACGTGACCAAGAAGAAGAAAAAGAACAAAAAAAAGA 

AGAAGAAAAAG AAAAAG AAAAAG AAAAAC AAAAAGAAG AAGAAAAAGAAC CAGAAGAAGAAAAAACAAGAAT AAAAT C CA 
AAGCGT CTT TAAAATCAGACGCAAAGTC CAATCCAATACCGTATAAAGAT ATATT GGATTACTT GAAT GAAAAAGCAAAT 
AAAAATTT CAAT C CT AAAGC AG AAGG AC AT AGAAAGTTAAT T C G CG CTAGAT G GAAT GAG G GGT AT AAACT AG AG G ACTT 
T AAAAAAGT TAT C GAT AACAAAACTAC G CAAT GGT TT GGTAAGAAAAGTT T T GAT G GAAAAC C ACT AGAT CAATTT T T AA 
G AC CG AGC AC GTT ATT T GCACAAAAAC AT TTT GACAACT ACTTAAAT GAAAC GGT CAACAT AT CC AAT C AAC AACAT GGA 
GAT CAGATT GTT AT ACCT GG ATTTAG GG G GG AAAT G C CGTTT TAGAAAG GAGT ACT AAAT GT GAAAAAGATAC AAG ATT C 
TTTTGAAAAACTTACTAAGTTAAAATTTG CAGAT GAACAAT GT GATAAGCACACCTTTAATAAACATGGGAAAGAAGTTA 
TT AAATT AGTTAG GAAAAT GATT GAT G AT GC AG GAAC G GTATATT GT CCC C GCT GC AT G GT T GAAGAG C AAAAT T C AGTT 
TT ATT T C AACAAGCAAATAAT CAT TATAAAAAGATTAATAGAGAACGGAAGAAAAAT GTACT CT T T CAAC AC AG CAT CAT 
AGAAAAT CAAT CC ATT ACAG AAT CAAGATTGT CTACATACAAGACGGATT GT CAAGAAACGAAAGAAAACAAAGAAAAAG 
CTAT AAAAATT CTT GAAC G C ATAAAAAAC GGT GAGTT TTTAAAT GTAT ACAT T GCAGGGAT T CAAG GAG TAG GAAAAAG C 
CATTTAGC GTAT GCGATGCT GTAT G AATT AGTTAAAC ACT ATT GGGTAATAT C AGAC GGTGAGAAATT AAAT GAC GAACA 
T G CTT T TAAAAATAT GAAAAG CT G CTT ATTT GTAGAGATT GAAAAGCTAATT C GAT TAAT ACAG CAC T CTT TT AGAAAT A 
TAGAGT CAAAATATACAAT GGATTATT GTAT CAGTTTAAT GGT AGAT GT GGATTT C CTT GT AAT CGAT GATTTAGGAGCT 
GAAAGT G GT T C GAT GAAT C GAAAC G GAGAAG CAAGC GATTT T GTT CATAAAATACTTTAT G GT GTT ACAAAT G G AC GG C A 
AG GAG CAAATAAAACAA CAATT AC AACTT CAAAT CT G T CAAG C G CT CAAT TATTT C AAAAATAC GAT C C GAAACT AGC AA 
GTAGATT GT TAAAC GGT GTAT C GAAAGAT GAAACAAT T GTTTTTAAAa C AAC CACT GACAAAC GAATT GT AAATTT AGAC 
ATT G GATT CTAATAAAAG G G GT G C GGAGAAAT GAAAGAG GT AAAG G G GAAAAAC AC CAAATTAAT G GAAGAATTT GAC GT 
GTTATTAAGACAACTGCT GATTAAATCTAAAACAGATGAAAGGGTAAAAAACTTTTTGGATGATCT GTTT GAAAT GCT AA 

GT GAT AATAAG CT GC AG TCT GAT ATT GATTT CAAAACAGCATTAAATAAGTTAAGAGAAAAGCACTTTCCTAAGTTT GAT 
AAAGGAGAGAGCAAAAATGACTAAAGAAAAGGGACAAGCTAAGGAAGTAGTTAATGTTCGTGGAATGT CAGAT GATGAGT 
T TATAGAGAAATA C GGAAG G C TT GTAGAT CAT T GC GTAT GGAAAAGAT AT G C GAAAAAAAAGG C C AGT AT AGAGCGT GAT 
AC C G GT TT AGAT AT T GAG G ATT TAAC ACAATT CGGAAT GAT C GGTTT G AT AAAGG C G C GAG ATAAT TTT G AC CTT GAATT 
T G GAT G T G C GT TTT CAAC GTAT GCTGTTCC GAAAATTAT T G G G GAAAT AG GAAG G G CAATT C G GGAT AAC CAAAAAAT AA 
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AAGTTCAAAGAACCGTATATGGCGTAAAAGGAAAGATTTTAAATCAACAGTTAGCAGATAAAGT^ACCAGAAGAAATAGCA 
GACATTTTGGATGAGTCAGTATCTTTAGTAAAGACGGCTTTAGAGTATCAACCAAGCACAGATTCACTCAATAAGGTTGT 
ATATG CATCT GGAGCTAAT GAAGAACT GACATTAGAAAGAAT GATAGAGGATACTAAAACGGAAGACATT GAAGAAACAA 

CCATTAATCGAGCTGTGATAAGAGAATTTAAAGCTGCATTGCCTCCTAAAGAATATATCGTTTTAGATATGCGTTTACAA 
AATAT G AC G C AACAAAAC ATTGCAAAT CAAAT GG GATACAGT CAG GTACAAATTAGC CGTATAT TAG CAAAGATT AAT CA 

AAGAGCTGCTCAATTTGGTAAAGAAGGAGGGCTTCAAGATTGAGTGTTACAAAAGGTGTTTGTATCGATGTAGATCACTC 

AGATTTGCTACATGAGAAAGTAGAGTACTTTTTATTCCCTGCTAAACCAAGTCATTACTATGTAAGCAGATTTAATCGTA 

AAGGAGCGCATTTTGGTTGTTATCAAGCTGAAAGGTTTCAAATCACGGAAAAGGAAGTATGGACACCAGAACCTCAACCG 

AATCTGCCTGAGTTGAATACAAGCTTATTCTATAGAGCTCAGTTGATTTGGCGAAAAAAGGGGTATAAAGATAAACCACT 

TAAAGACTACATCGTACAGCCGAGAGGGAAACATTGCTACTTTTGGCATGATCGGGAGCGAAAGAAATTTTGTGGCTGTT 

TTCCGCTACATTGGTTTACCGATTTTGTACCAGTTCAAAGTCATCATATAGAAGAAAAAACTAGAGAAGAGGTTAAGTTA 

TTACAACGGCCAGATGGACAACTTGCATTTTTTTAACGAAAGAAAGTGAATGGGCGTTTTACCCAGTCATCGATTTAAAA 

AAAGGAGTGTTCGTAATGGATATTAAAAAGTTATTTGCAATGCAGAACATTTTGGATAAAAGAGTTTTAGAGTCAAAAAA 

TCTTTCTAGAGGAGAAGTATTCGAATTTAGAATACTAGCGTTTTTAGATGAATTAGGCGAATGCATGAAGGAATGGCGAG 

TAT TTAAGTTTT G GAG C G AC GATCGTAAAC C GAGAACTAG C ATAC CTACAG G G GAAATCATAGTAC TAGAT GAT GGT TAT 

GAAGTAGAAGTTTATAAAAACCCTTTACTTGAGGAATATGTGGACGGACTACATTTTGCAATTGGACTTTGCATAGATTT 

GAAAACAGAAATTAACTTTCCTGCTTCTATGCGTTGCGAGACAGTTACAGAGCAATTTTTCGAATTGTATCATCTAGCAA 

TACGATTAAAAGAAGAACCGACAGCATTTAGGGCAGATGTTCTTTTATCCCATTATCTTGGTTTAGGGGAATTGTTGTGC 

TTTT C GTTAGAAGAAATT G G AC AT GAGTAC ATT GAGAAAAACAAAAT CAAT CATGAACGT C AAAGTAAT G GATACTAATA 

C AATT T G AATTT T GTTAAGAAAAAGT GAGTGAGAGAT G GAAC TATT AT GAACT ATAGAATT CC AATAT T G G GAAT CT ATA 

TT AAT T AT AT AATTT AAAAAT GTGGTAAT G GTTAAGATTTT AATATAG GGAATTTAT GAAGT G TTAGTAT GAT T T G AT TG 

GCT GT CTTTAACTTTTT ATT AGT AATT T CAT AT ATT GTAG G GT G CAAT ATT GAAGAAGTAT G GG G G G GAGAAAAT G GATT 

GTTTTAAAAAAGGTAAATTTA.TACCATTTCCATGTGCTTTACCAATTCCTGAAGCTGGTCCTACTGGCCCAACTGGTCCA 

CCT GGAT CAG CT G GAG GCTC GACC GGT CCAACT GGT C CAAC C G GCC CG CAG GGTTTACAAG GGATT CAAG G G GT T C AAG G 

GAAT C CAGGAACT ACT GGACCT CAAG GAAT T CAAG GAAT T C AAGGAATT C CAG GGGTTT C AGGT C CTATT GGT C CTAT T G 

GTCCTACTGGAATCCAAGGAGTTCAAGGCATTCAAGGATTTCCTGGCATTCCAGGTCCTATGGGCCCGATAGGACTAACC 

G GT CCGACT GGT AT CCAAG GTATTCAAG GGATT CAGG GAGTT CAAGGT AT CCAAGGTATT CAAG GGGAT GTAG GCC CAAC 

TGGCCCTCAGGGAATTCCGGGTATTCCAGGATTAACTGGCCCAACTGGCTCTCAAGGTGTTACTGGAGTTACTGGCCCAT 

C C G GAGGCCC AC CAG GT CCAACTGGT G C AACAGGT CCAACC GGT CC AGCT GGAGGCC CACC AGGT C CAACAG GT C CAAC C 

GGT CCAGCTGG AGGT CCAACAGG ATT AACTG GCC CGACTGGCCCGACT GGT CCAACAGGAATT CAAGGT ATT CAAGGGGT 

ACAGGGTACT CAA.GGTATT CCGGGTCCAACTGGTCCACAAGGGAT CCAAGGAGTTCAAGGACTTCAAGGAATAC CAGGCA 

TTCCAGGTTCTATGGGCCCAACAGGACTAACTGGTCCGACTGGGCTTCAAGGTATTCAAGGGATTCAGGGGAATCCAGGT 

CCGACTGGTCCCTTTGGCCCGACTGGCCCGACCGGGCTTCAAGGTATTCAAGGCTTACAGGGTATTCAAGGTATTCCAGG 

ttCCAACAGGACCTCAAGGAATCCAAGGTCCAACAGGACCTGCTAGCACACTTTCCACAAAAGCTATTCTTTTTtGGGGG 

TACTAATT CAG G GTTT CAAC GTATAGCT G GAT CAC C G GGT G CAG ATT C AC AAG AC AT T CCTTAT G TACTT GG CG GAG CTG 

GTAGT G TT GTAG GT C T TT CT GCTT CTATAAGT ATT AAT AATTT AC CAAT AG GAGT AT AT AC AAT AC G AGT AT GT AAAAAT 

GTT C CTAT TAAT CTT G CT G C TC C GGGGC CT G G CCAAGTAAT AT CT ACAATT ATT CT T ACAACT AC AGC AGT GAT T AGT G G 

CACTATTATATTGACTATTAATCCTTCTGATATTGGTGCACAACCTGTAAGAGTATTTAACCCTAATTTAGTTATAGCAC 

CTGCTACAGTTGCTTGGAGCAGTACAATACCTGGTGACATAGTTGCAAGAGGTGATGCAATGTCACTTTTTATAACTCCA 

GGTATTACGCAAAATGCTGTGTATACAGTATTCTTGCATACAGGAAATTAAAGTTTATTTTATGTGAATTTAAGTCCTGT 

AAATTGGAATGAAAAATTAAGATATGTATCGGAGTCTTTTTATGTACAAAAGAATAAGAGATTTCTTCTGAACATCTAAA 

AGG AAT CT CT T ATT CTTAAT C GATAAATTAG GTTT TAGAAAAAT GAAAAGAT TTT GTAT G AAAATAAATAAAAG AAT C C A 

TT C GTTACAAAC G GAT T CTT C C CACAAG GT GT GTAAGAAAT T CAAGATAACT C GACCAGAG CAT CATGT AGAAT T T CTT G 

T G ATATTAAT GTATT CAAAG AC AT CCAAAAGATGAAT G G CAATTAAATAAAAT CTTTATTT GAAAATTAAAGAT T G CTTT 

T GTTAG G GT C T CTAT G ACTAAGAGTTAT CTTAAT T T TTTAG GTT T AT GAAGTATTTGAGTAATAATT T AGTTT C AGACAA 

AG GT GAT GTT TAGTAG TAAT ACC G GT T T G CTT CAGTAG ACATT G CAATT G CTTTT GTTTCAT GAACT GTAC CAT AT GGAT 

GTTCTGGAGGTGCATATATAGCGTAAATTTTAAGTGGTGTATTTCCTGTATTGATTACATTATGCCATTTTCCAGCAGGT 

ATCATAATTGCATAGTCATCATAGACCATTTCTTGAAAATCTAATTTATCTTTGTTATCACCCATTTGAACGAGTCCTTG 

GCCCTCTTC7\ATACGTATGAATTGATCAGTTGTAGGGTGTACTTCTAAACCTATGTCATCTCCAACATTAATACTCATTA 

AAGTTACTTGTAAGTTTTTTCCTGTCCAGATAGCGGTGCGGTAAGTATTGTTTTGTTTAGTGGCTTGGTTAATATTCAAT 

ACAAA.TGGTCTAGCTCCATAATCTGTTAATCTAACATTTTCACAATAAGGATTCCGGTTGCGGTTCCAAGCATTATTGTT 

GTAATTGTAATAATAAGGATTCCAAGCGTAAATCCA^TTATTGTTATTCCAGATGCTATCCATTGGGCTTtGAGATTGAT 

AATAATAAC GT G GAAT AT GTT GCATAT C CAAG CTC C T CT CATAATT G t AT CATTTAC TTT T TATC C TAT G CT GT T GT CTA 

TT TATAGGAAT G CAGAATAAGGG GAAAT G GG CAGT AATAAAAAATATAAAAAAACGTTTTTATTTTTT CAG G AAAAAT AA 

AAGTAACAAGTTAATAAGGGATGTACTACTGGTATAAAAAACTTAATAAAATAGTTATTTGAATTAAAAAGAGCGCCGTT 

G GAGAG T GCGGTGCTCT T AGACCAAGAACTATAACAG G GAT TAAGGAAAGAATAT T GTATAC CAAATT GATAGTAAT G C A 

AGCCATCCAATTGTCAGCGCTATGTATTTTAAAATTTTCATGATTACTCCTTTTAGGTATAGAGTGCACCAAGCAAGAGG 
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ATGTTATTAATTTTTAAACAAAATGCTTATTTAAAAACTAAAGA^^ 

G7^TACCTCATGATACTGTATGTATGTTTTTTTAGGAATGTGAGGATTTAAAACAAAATCGTTATTTTATAGATCGGAGT 
GAAATTCAAATGATTGTTAAAGCGACAATAAAACTTGAATTAGATGATTCGCAGAAAAATTGGGTTTCTTATGTTAGAGA 
ACAAGGTGGAGAAGAAGCGGTATTTCATTATCTGGAAGAAGAAGTGCAGAAGAAAATTGAATTAGCTGATTTTGTGGAGA 
TGAAATACAAAAATAAGTAATTTAAACCAAAACGCTATTTTATAAAAT7\AAACAGCTAGCGTGATTAGCTAGCTGTCCTG 
TTAAGAAAAGAAAACGGTGTTTAGCAAATGTTGCTGTTGTAATTGCGAATTACAACCATAGTATGAGCAGAAGTAAAAAT 

GTTATGCAAGAAAGTTAAATAAAAACTGCATTTTATTGAAAAGGGGGAATGGATATGTCTCTAGTAGGGAATTTAAAGGA 
ACTCC AAGAAAAAG C CAT C GATGAAAAG GTAT TG GAAT T T G C GGAAGAAAT G GAAAT C GT AATAAC TAAAAGT G C CG C AA 
GCGGATATT CAGGTCATAGATATAAGATT CAT AAT G AAAAT CCAAATCGGCATAT GAT GT GTTCAAAAATATTTATAGAA 
AAGTTACAAGAATTACTGGACGGTGTGAAGGTTGAATTTAAGGAAGAAGAAAAGAAAAATATTTTAGGCGGATCTTACTA 
CGAACATTACATCCGTTTTAAGTGGA^.TGACTAATTTCTTATTAAAAATTTTATTTTGGAGAAAGGGAGTAGAAAGAATG 
AAAACTTTTAAT GT GACTTTTACAGAGTT GAAAATATATGAAG CAGTCATTGAAGCG GAGT CAGCGGAAAAGATTATT GA 
TGT GATTAAACACTTAAAAAGAACT GAAGATGATTTAGTAGACAAAGGAGTCATCATAAAC GAAGTTAGTGAGATAAAT G 

TTAGTAAAGAACAAAAGTTCGAATAAATCAACTTCTCAGATTGTTTATTTTGAGACGGAAACAACTTTCTGAATATCATA 
AGACCTTATT AGCGAAAAAa CT CTT ATT CGAG CGTACAAGCCT GTTATACACGTTGCAC GGAAATTAGAAT GAATTT GTT 
AAG G AAG GAAGT ATAAAAAT GAG GG CTT G GAAGAAAAAACAT GTTAAAAGAG CATTT TT GAAT C G T CAAAAG GAAAT T GA 

TAT^G^ACGGACTGCTGCAGCTTGGAGAAATATTTTTGTGAAATCAGGAATCATAAAATAAAAAAGGAAAAGCAACTCGT 
T G G GGACAAGT CACTTT T C CAGAT GGCAAT GTAAAT C CATTATAG CAAAAC a TAT GTAC a AG CT GT AG CAATAAAC AAC G 
AGAT ATTTTGAC ACCT AT CGAC AATT AGAAAT GT G GTT GTT GAT CT AGAAATAT GAAAGTAG GTGAAT CAT CATTT GTTT 
AACTGGCTGAGAGATT ACCAAAAGTTAGAAGAAGACATAGC CT AT CTGGAATACAACTTAGATAAGACAAAAGCT GAATT 
AAGAC GCT G G GTG AGT G GT GATT T GAGAGAAGTACGTTTAAC GGC AGAAT CT GAAGGT G CAAAAGT T GAAAAC C G CAT T G 
AAG C GAT T GAAT AC GAATT AG C ACAT AAGAT GAACGAT AT GT AT AAATT AAAAAAGT TAATT AGTAAGTTTAGAG GTTTA 
GAAAAT CAGAT AC T CAAATT AAAAT AT GT G GAT GGT AT GAC GT T AGAAGAAAT AGCAGAGGCAGTAAATTATAGTT CT AG 
TCATATCAAAAAGAAACATGCTGAACTCGTTAGATTAATTAAGTTCGTGGAGCGAGAAGGTGTCATTTAGGTTCACTCCT 
AAAAT GAAT C G AAACG GTT G AAAAAAT GATTT AT AT T GATAGCAT ACAATTTT AGCAGAAG G G CAACT G GT G CAC GGT T G 
CT CTTTTT GAT TT T G G AG GT T ATTAG ACGAT G GAT GTAC AAGAGT T GT CGAGACGAT T AGAAAAT CTAGAAC ATAAAGT G 
CT TCAGGTAGAAAC GAAGGCAGATGT GCT AAACCGAACAGCT AT ACAAAAAGGCGAT AAAAT AAAAGT G GT GTAT C C G C A 
TTTAGGGATACAAGGC GAGTATTT AGT GG AG AAAATT GAT AAT GGT GT GTT GGAATT G GT AGCAGAAGAAACAATGAAAA 
AAATACAG GAGT GATTAG GATT GAAGAAG TTAT CTAAACAAGAGCTAGCAGCT GTAAT GACACATT GT ATTT C AAC G CTT 
G GT GAG CAGAT T GTTAAT GAG CATATTAAT CC C C AGAAGTT G G CG CAAG CAAGT GCACT C CAT AAC GAT CT CTTT GAT AA 
TAC CACT C CTAAAGAAC GTAGGGAAG CGAC GAT C AGTTT ACT AGG GAAAGCGATT GAT G AGT TTTTAGAGAGT AAG GAGT 
GAGGAT AT GGGAAAGG GAT ATTT TAATAAGGCT GTAT GTTT AGT GT GT GGTC AT CAAG AT AGAGT GAAT CAT C C AT CT AA 
AAAAGAGTATCAAGAAGTAACGGTTTGTCCGGAATGCAACGGTGCTTTTGTAGATGTGTGGAAGCTAGGAAAGTACAAAC 
GTAATACACAGTCTAATGAAGAACCTTTATTAACAATTACATTAACAGATATAGATGCTAAACCGATAGTTCATTACAAA 
GGTGAACAGATAGATAGAAAGTTAC GTGTTACGTTTGATT G G GAAT CT CAATC GATT G ATAAAATTAAT CGGACATACAT 
T CATATT GAACAT GTAC CAG CC GATAACAAAC GT TTAAATAC C GAGAC C ATT C AG CAT AAT C ATCCT ATTG CAAAT AAGG 
AACAAGTTTAGATGTTGTCCATATTTGTTAATAGGTAAAAGATAAGTGTTTTATCTGGAAGTTCAAACGTGAATTAAAGA 
AAT TAAAAAAGGAATAT GAAAAG GAGAGT CACT GAAT GAAC G GGTTTAATAAAATT GTAAAC GAT AT GC AAAAT GAACAA 
GTAGGAAATGCTATGCTAGATTTTGCTTTGGCCGCTAAAATGATGTTCGCTGCCTTTACACAGTTTAAAGAAGCTGGATT 
TAAC GAAGAG CAGT CATT C GAAT TAAC AC GTG AGAT ATTAATT GATT CAT TAAGTAAGAAT CAATAGAT C AATGAGGT GA 
AAG GGAAT GCAAGTATATT G CTC T GAGTGT GATAAAAGTTAT GACATG CAGC CG CAAG TAACACAACT CCCT AAT C GTAT 
TGAGAAGT GTTT CTTTATTT GTCCT CATT GTAATCAT GAACAT ATAGCTGCGTACGTGAATGATAAGATTCGTAAGTAT C 
AAGCAGATATAG CAAAGT GT CAT GAG C GG ATTAATAAAAAGAATCT TG CTAT CGAAG AT GAAAT GAAAC GATT AAGG AAG 
AGGTT TGACAG GAGAAAGT GAGAG G T GAAG C GAGTTT GAAAAT GCTAT TAACAAAG CATT G GT GTT TAGATAGAAAC T G C 
GGATTTGAAGAGACTTCTCATAAGGTACGTGATGGTTGGAAATGTCCTGATTGTAATGGACCAATGGCGTTTCAACAGGT 
GAATAAGAAAAAAG a AAGCG C CAAGT GAT G GT G CTT TT TAT TTT GGAG GAG GAT GAAG GAT G GAAG GACAGGAGTTAACA 
TT GGAAAAGAAAGACAGTAT TTAT CT TAG ACCAAGATAC C CT C ATAAGATT GAC GCAAGTAAAATCAAATC C TTAAAAGA 
TGTAATTAAGATTTTAGGATTGATGGATATTCGTTTGGACGACAAGGCGGTCATTGGTCTAGAACACTTGATTGAAAAGG 
AG GAAGAATAAAAT GG C C AATAAC AAATT AAT TATT GAAGTAACT GCG GATACAACT GAG GCAT TAGAAG GAATTAAAGA 
AGTAACT GAAG CAG CT AAT GAAT GT G CAGATGC GCT GGACAAATTAGAAAAGATTAT G GATAAGT TT AC AAAT CGAAG T G 
ATACAGT GGAACT CTATTGT GAAG GTAAATT GTTAT C GAAGT CTACAGT TAAT C AT ACAGCT GATT C a ATT C AAT GT C G C 
ATAATCAAGGGAGAAGAGCTTGGAGGAAGTGAACGCTGATGAAGAAACCGCTTAGACCATGCTGCGAATTTCATTGTTAT 
AAT CT CACACGT GAAAGATAT TGT GAG GAACATAGATACAAAGAGAAG GAAACG CAG CAG GATAAGAATAGATACTAC GA 
C C GATT CAAAC GG GACAAAGAGAG TAC G G CTTT CTATAG GT CAAAGGC AT GGGAAAG G T TAAGAGAG CAG GC AC TAAT GA 
GAGACAAAG GGTT GTGCCTA C ATT GTAAGAACAATAGAAAGATTAAAGT TGCAGATAT G GTT GAC CAT AT CATT CCAAT C 
AAAGTT GAT CC AAGTT TAAAACT CAAAT T AGAAAAT T TAC AAT CACT TTGTAATC CAT GT C AC AACAGAAAAAC AG CAGA 
AGACAAAAAGAAATACGGGTAGGGGCGGGTCGAAAAACATTCAGGGCGGTCTGTCCGTACcgccgcccc 
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W phage: Polypeptides Encoded by Polynucleotide Open Reading 
Frames 

Orfl (SEQ ID NO: 4) 

MAGRNKQPLSVIQGKGRSNHITKSEKNRREKQEEALRGHTDKIEAPSYLTAAQKKEFDTI^AELVRLKIFS 
NLDVDSLxARYVDSKDQYIKMVRLLRKTKPSDDFKLYSQMQRS KNLLFN ECRSSASDLGLTITSRLKLVT PE 

VDTSQQKQSEAQKRFGDRI 
Orf2 {SEQ ID NO: 6) 

MNWIMERVFAYCEDILNGKINSCKKHRWAIERFIRDYEECQSEDSPFYFDGEIAEDFYWFAKEFKHVEGI 
IAGESVELTDFQLFIiAANIFGFKKKINGARRFRKVFIQLARKNAKSQFIiAIVAAFCTFLGDEKQRAYIAG 
WTRDQSSEWEAVKTGISSSELLEGKWKEAYSTIEIFKNGSVWPLSKEARKTGDGKNPSLGIVDEYHAH 
ETDEIYDVLSSGMVARKEPLMFIITTAGFDLSRPCYREYEYVSDILDPSKNVENDDYFVMICELEKNDDI 
KDESNWIKANPIVATYEEGLEGIRSDLKVALDRPEKM^ 

DFSGATLWI GGDLSMTTDLTSVGWVGMDDEGDFI VGQHS FMPEARLKEKMAI DKVRYDLwAEQGYLTLTP 
GEMVDYTIVESWIENFSKDKEIQEFDYDKWNALHIjAQNLENKGFVCVEIPQRIANLSI PTKNFREKVYEK 
KVKHNGDPVLFWALNNAVVKMDDQENIMI SKKI SKNRI D PAAAVLN AFS RAMYGAS VRFDVS EFAN KDFL 
GKLWN 

OrD (SEQ ID NO: 8) 

VKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPL 
KI YQEDE YGIQRGTKH YLNNLLRLRPNP YMS SMN FFGSLEAQKNL YGNS YANI EFDRKGKVQALWPI DAS K 
VTVYIDDVGLLNSKTKMWY\A/NTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFIN 
NFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTEL 
TIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVD 
AILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSK 
EGNEGN 

Orf4 (SEQ ID NO: 10) 

MEKS AKKEMKEI RALPMTI EVREVNEDEGKRT I S G S I K YNN E S AEMRDWW GDT FVEEI AEGAFDES LKVRD 
WGLWS HDT SQVLGNTKS KTLRI ENDKKELRFELDI PNTTVGNDAWELI KRGDVDGVS FGMKVTKDKWS S E 
ERENGKLYKRSILNAELYEISPVAFPAYPTNEVSVRSLDDFKAGEKRVADEFRKRKLQIELELI 

Orf5 (SEQ ID NO: 12) 

MSKELRELIAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVD 
GEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTV 
EPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVT 
KWLGKKS KVTRNVLI LGVI EKLTKQAI KS LDDI KDVLNVKLDPAI S PNAI LLTNQDGFN YLDKLKDKDGKY 
ILQSDPTQKNKKLFAGTNPWWSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTR 
NTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 

Orf6 (SEQ ID NO: 14) 

MLWLEEAKEWIRVDGDDDPTITMLIKAAELYIYKATGKTFTQTNEDAKLLCLFLVADWYGNRLLVGEKAS 
EKIRTIVQSMILQLQYASEPQEERK 

Orf7 (SEQ ID NO: 16) 

MNPAKLDKRLT FQVKDENAKGPDGDPIDGYKDAFTVV7GSFVYLKGRKYFEAAAANSEVQGETEIRNRDDVS 
ADMKIKYKNVIYDIVSVIPTQDHTLLI1WKRGEMMG 
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OrfS (SEQ ID NO: 18) 

MKLTLMINKEKQTFNMPEFI PARLIRQAPELAEI PNNPGPEDMDKMVQEVVKVYDGQFTLDQYWDGVDARK 
FLSTTSDVINAIINETVEAAGGSTESGEEENPNA 

Orf9 { SEQ ID NO: 20) 

VINLRPDILQALENDQELVSLLGGKRIYYRKAKKAEEFPRITYFELDNRPDGFADNQEIESEILFQVDVWA 
KSSTTAIHQKVNEIMKRIGFSRYAVADLYEEDTQIFHYAMRFAKGVEL 

OrflO (SEQ ID NO : 22) 

MAGE WRI S S TVGVDNLVYAKVLQ D DS S AI K YTDVKKMEGAVKVKLT KKVAS EVMW S DM RK SEIAES DGET 
EVEIEVRGLSLSTKADIEGFPEVKDGVLDEKREGEKPYLAIGFRFLKANDKYRYVWLLKGKIjSQEEEEAET 
KKDKPNFQTTKLKGSFIERDFDDRTKHTADEDEPTFTKLVGDNWFNKVYEKPVTQPPAGK 

Orfll (SEQ ID NO: 2 4) 

MKLTLMINKEKQTFNMPEFI PARLIRQAPEIAEIPNNPGPEDMDKWQFWKVYDGQFTLDQYWDGVDARK 
FLSTT SDVI N AI INETVEAAGG ST ES GEEEN PNA 

Olfl2 (SEQ ID NO: 2 6) 

MDELYLSLLRQGYKHHHIDNEMDIWHYLRLNRKMHENGNENYEGSNSNEIEVPAENII 
0rfl3 (SEQ ID NO: 28) 

MANEI NNLWRLS LDNVN FRQGI SN S GRAVRTLQNELKSVSTGMGG FANASQQTQAKMNTLS RLI DAQKEK 
VKALRQAYDQNKAKLGEN DAATQRYASQVNKAVADLMRFEN ELKQVNRQAEQKGMDKLNN SLKS LQAEFQS 
I TT GMGG FS NAT EQT RAKVDVLS RMVDKQKEK I RELQQAYN RAKT EEGEAS Q S AQRYAEQI HRAT AELN RF 
ETGLQQSNRELEQQGNRLLNFGNRMETLGNHLQNAGMQIGMVFGGMTYAIGRGLKSAITESMNFEQQMANV 
KAVSGSTGAEMKKLS ELAVNMGETTKYS SVQAGQGI EELI KAGVS LQDI INGGLAGALNLATAGELELGEA 
AEI ASTALNAFKADHLSVADAANI LS GAANASATDVRELKYGLSAS SAVAAGAGHTFKDTATTLAVFAQNG 
LKGSDAGTSLKTMLMRLNPSTKEAYNKMRDLGLITYNAQAGFDFLVKNGIQPASRNVGDIEVALEQYVMKT 
EGVTKWNDKCDTTFRELATSSAFLSSKFYDQQGHIQSLENISGTLHESMKDLTDQQRSMALETLFGSDAVR 
GATILFKEGAKGWEMWDSMSKVTAADVAATKIDTLKGRLTLLDSAFSTMKKTIGDALAPWSVFVAGLQK 
LVDG FNS L P G P VQ KAI AI TGG I VLALTAVATAI GWLAAFGMI AS G I G S LS LALASVGGIAGIAAGAVGFL 
GSALAVLTGPIGLVAAALIGTGWAYKAYQKATEDSIASVDRFATNTEGKVSSSTKKVLGEYFKLSDGIRQ 
KLTEIRLNHEVITEEQSQKLIGQYDKLANTIIEKTNARQQKEIEGLKKFFADSYVLTAEEENKRIEQLNQH 
YEQEKLKTQEKENKI KEILQTAARENRELTTSERI SLQALQDEMDRVAVEHMSKNQMEQKVI LENMRVQAS 
EI SARQAAEVVENSAKARDKVTEDAKKTRDEKIAEAI RQRDENKTI TADEANAI IAEAKRQYDSTVSTARD 
KHKEIVSEAKAQAGEHANQVDWETGQVKSKYQAMKDDVIRmKEMWSDVTNKYEDMKNSASNKVEEIKNTV 
SRKFEEQKKAVTDKMSEIKSSIEDKWNTVEKFFSSINLRSIGKSIIEGLGKGIDDASGGLFSKAAEIASDI 
KKTISGALEINSPSKVMIPVGSAVPEGVGVGMDKGKRFWDAAKNWGTVKKQMGNMPSVFDFGFQTNQYS 
IPQNTFSDFSGYMQPQLSYNNPSMAKTIFPNRPGGEQELNLTVNMTNVLDGKELANGSYTYTTKLQNREQK 
RRAEF 

Orfl4 (Tail fiber; this sequence differs from that in y phage) (seq id no: 30) 

MGKLSFTFNNIRKDYIQMLVGRKR^SWAPVKRRLVRVPHRAGALLLNTETEERR 

QKLKEDLADWLYTEQPAELI FDDELDRTYLSLIDGSVDIJ^EIVNRGKGVTTHVCPMPYKLGKINTHKFTQ 
EWSTETTSYFTNKGSVEAPALIEMTVKKPSTFIJDWFGEYPHNRDYFRIGYPL 

MATPIGWTPVTGQFEEMKGTGSFKSRGGHAiYCEDYGKETGFYGAIAKKNIPGGPLQDFEbffiAWVTIjKSK 
NISEMGRVEVLLLDETSNVISRIlSRtfNDLYATAEITRAH^ 

IiRI ARRGKVW S VYVAKF I DGTEKD GAS LVERWI DETGNPMTERKI AQVMI AI CKWDNHQ PINEMQI DDLK 

IWKVNKVPSNAQPYIFDTGDKIVIDTEKSLVT^ 

YRERYR 

Offl5 (SEQ ID NO: 32) 
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"DEYIDPLTFLKKIASLFKLEI^ 

KRVNSSISYEVEAQSlGRIFGLEHELINEGDTIKIKDTGFTPBLYLEARVIAGDESFTDSTnnKYP-prnvD 
^^QNEELRKIYNRILSSLGNKQEMIDQLDRLVQEANETASNAKKESEAAK^ 

qnqatgqfnevkeslqgvsrtisnvenkqgeidkkitkfeqdssgfktsiesltkkoteiSkln?^ 

^KKTI SE VQQ TTNDIl KKKTTEIEEKAGKITEK L TSLETREVNVRNYVIN 

^wiseasaikkglqitsnkafvyqki-padvfkkkkgiascyinvssbtpgtdyprlwrSScte 

KQ^AILKQQE^GWIRISIPFDTTGYTGELKEVRVNIATAD^^ 

^Q^fqsktteieksvdgvkttvtnvqnsqagfekrmsnveqtatglsstvsSwsS 

^,™ ATAI GAKVELI <2 V EDYVAGFKI PELKQTVDKNKQDLLDELANKLATEQFNQKMTLiSnrSS 

gmaaakktewtktqadgqfatdsyvrdmesrlqltekgvsisvkendviaainmskeniklnaaridS 

G^AEWIKAGLLSGCQIRTSNTDNWSLDDQFIRLYERGVARAFLGHYRRSDGAVQPTFILGS^S^ 
EGTLFMSQAGAGWSGAYASIGISNGIVDGAVQKSVYWELQWxIGLSVLNANDYHVFYAGNGNWYFRRGKPGL 
YQTSLWEDNSTDSDLRLPNWIRigSRAAGYTGVIQLKSPXTTQNGWGAVQGNFMTPSLREYKSNIRDISFS 

alekirslkirqfnyknawelyrmreekspndpplttediktyyglivdecdemfvdesgkgihlysyas 

IGIKGLQEVDATVQEQEVEIANLKSQIASQEDRIARLEELLLQQLINKKPEQP 
Orfl6 (SEQ ID NO: 34) 

MDRIDVLLKAFIAAFGGFCGYFLGGWDATLKILWMWIDYLTGMIAAGYNGELKSKVGFKGIAKKWLFL 
LVGAAAQLDSALGSNSAIREATIFFFMGNELLSLLENAGRMGIPLPQALTNAVEILGGKQKQEEKKGDVQ 

Orfl 7 (PlyG lysin) (seq id no:36) 

MEIQKKLVDPSKYGTKCPYTMKPKYITVHNTYNDAPAENEVSYMISNNNEVSFHIAVDDKKAI QGIPLERN 

AWACGDGNGSGNRQSISVEICYSKSGGDRYYKAEDNAVDWRQLMSMYNIPIENVRTHQSWSGKYCPHRML 

AEGRWGAFIQKVKNGNVATTSPTKQNIIQSGAFSPYETPDVMGALTSLKMTADFILQSDGLTYFISKPTSD 
AQLKAMKEYLDRKGWWYEVK 

OrflS (SEQ ID NO:38) 

MKMYKKLISICIGSTLLLGLTACDSSKQSESSEKTNVKSQPETKKDLTSQDELNKKIKQDAEEVSFVKAJM 

GDQYEKGKRLKATGTVDLLLKSSALPSFVISTNENDGKGMYTIQIVQSGVQTNENEITLKNGLKISKGSI 
VT I YGAY DE KDKT GMP KI S ATVI EQ 

Orfl9 (SEQ ID NO: 40) 

VRLKCKLRVIFAEREIRQKEFSKLIGISQTTMSSLVNNTTLPTFLTAYKIAKELKLHMEEIWIEEENENV 
Orf20 (SEQ ID NO: 42) 

MRWQYNHLNTTPYLHPSKELCSMYNGSRSRAETESILNHMKNHEVYDRKEYKGYFSLSQVLEEDLYGEEED 
VLNWEILMDCYDWLTRKGIAFREKEEEEQA 

Orf21 (SEQ ID NO: 44) 

MTLAGEAI I IWTATGLSWA1-IKAAEKMGKSVPHWLPRVTLYTTLTGS FLYLLRYVLVLFL 
Orf22 (SEQ ID NO: 46) 

mwklfipyvirslacMHVFLETGIYTLYKRDIRSDFMLELLSVPFAGLIFAIVGERLKGRESDRKKIQVFF 
EVSGIAIRREDKLQYPVFLEQKEDDRSTTYI YRLPVGMPSKIIQKVEDWSEGLSKPVRIDYDNYKLNIRV 
FHRDIPKKWSWSKGLVAEGSWCVPMGQSLEKLIYHDFDKTPHMTLGGLTRMGKTVFLKNWTSLTLAQPEH 
INLYIIDLKGGLEFGPYKNLKQWSIAEKPAEAFMILTNILKKMEEKMEYMKCRHYTNWETNIKERYFII 
VDEGAELCPDKSMKKEQQRLLGACQQMLSHIARIGGALGFRLIFCTQYPTGDTLPRQVKQNSDAKLGFRLP 
TQTASSWIDEAGLETIKSIPGRAIFKTDRLTEIQVPYISNEMMWEHLKGYEVEKHEDAI>IAYANQPSNGDT 
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Orf23 (SEQ ID NO: 48) 

mrwrnmrmqthmqinrqmailatirklqfatrrhlMSIHEMGGIRNANRILKDLSIYTSKVWNKEHVYYL 
NQSGHKLFGEGKVVHHGKVTHALLRNEAWLNLYCPDDWQVETEIKYI KDNKKKKI I PDVKFRDEDRILHAV 
EI DRTQKKI WDEKLKK YEELTQ I YKQKHN GKVPVI H FFT I T KY REKKLEELAN K YN VFVKVYVT ATT 

Orf24 { SEQ ID WO: 50) 

MKFTLGNSLDELGITKNKLSTESQVRYNTI SDLVNGNANAVRFDSLEAIIDALNAIAA.EKGINKI YKIDDV 
IQYIKKS 

Orf25 (SEQ ID NO: 52) 

MAFKASMIASSESKRTALALPFTKSLIVLYLTWDSVDNLFLVIPNSSKEFPSVNFILFSSAALVILYSFY 
NINRN 

» 

Orf26 (SEQ ID NO: 54) 

MLS SAN YT Q Y KKLQ S F RS VEEMN EAI C S FLY KHTHELSE S AI KVL K FLARH S CK I P GV S FL KVGT I A EALN 
ISDRTVRRVLKVLEDFEVVTRHKTIRTEGKLRGGNGHNVYVLLKKYSVTPNVLPKMSQRQDEENLTESKVS 

DTKTDKEAKLSESHPLEELKSELNVKETSARESKEIELEDLDETFTPENVPSQFRDWAPFFKSADKIYKL 

YHRVLIAYKRSKIDKPIEQVINQAIQAFKETVFAEKANKIRSTFEGYFYRIVESKFVMERRKECRGLLFDW 

LNE 

Orf27 (SEQ ID NO: 56) 

LKYAVYVRVSTDRDEQVS SVENQI DICRYWLEKNGYEWDPNAVYFDDGI SGTAWLERHAMQLI LEKARRNE 
LDTWFKSIHRLARDLRDALEIKEILIGHGIRLVTIEENYDSLYEGGNDIKFEMFAMFAAQLPKTISVSVS 
AAMQAKARRGEFIGKPGLGYDVIDKKLVINEKEAEIVREIFDLSYKGYGFKKIANILNDKGTYTKFGQLWS 

HTTVGKI LKNQTYKGNLVLN S YKTVKVDGKKKRVYT P KERLTI IEDHYPT IVS KELWNAVN S DRAS KKKTK 
QDTRNEFRG1VIMFCKHCGEPITAKYSGRYAKGSKKEWVYMKCSNYIRFNRCVNFDPAHYDDIREAIIYGLKQ 
QEKELEIHFNPKMHQKRNDKSTEIKKQIKLLKVKKEKLIDLYVEGLIDKEMFSKRDLNFENEIKEQELALL 

KLTDQNKRNKEEKKIKEAFSMLDEEKDMHEVFKTLIKKITLSKDKYIDIEYTFSL 
Orf28 (SEQ ID NO: 58) 

MRIALYRTHALINVIKYSVNI^KVLLIEMKGVSYLKFHEKIMGMIEDRDDLTATSVACKIGVSKQYMSKF 
KRQGTIGFSQLIJUoAPILSVEGKKAKQTMSDWCIJS^ 

IREYVEVYTILFKYIKNIIKGSEITKEIjKKIGAIKDKVLEILTKIMECYEYYHLKKFNLM 

REI E GERKS FI KEC YN YRI AELFAPI FLQKNNVDLARKY AHFL I HANVCTK TVSDAY Y I LGMSNVLESKEQ 
CLFNI^SYLLSKEIRDADIEQEARYNIJDVAKIYFGVKLDEDADSRLLLYQKNPTCELSIIALQDIIRDRG 

DKDFI.NYFIACSSDEIECLYDLFYQYFYQANYLFSAIVAKELCNRGDKSLLTQSMVNLGNEKQKGVVDIEE 
ISISSLYI INGSNSGIW 

Orf28.1 (not present in y phage) (seq id no: 60) 

VI I VEFKDRLRQLRRERNLTQHDLGQAI GVTAGSVSKFETGFKPASRETVERAADFLGVPVDYLLGRSDSR 
ELDADMNQKYLHIKNRLEQLPEEHQEIVLQNMLTMMESLEKLKSTSK 

Orf29 (SEQ ID NO: 62) 

MKVI KDETKLKAAFKKS GYKYQELADELEI S C S YCYKLINNHN Y KKKI S YNLAS RMAHVLNAS WDLFEEQ 
VDFF 

OrBO (SEQ ID NO: 64) 

MREHRGERAMS EI YYKGFI I KET YGERNI EEVFKEAYES FYGVEVKWKKELGTKRN SAAS 
Orf3l (SEQ ID NO: 66) 
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MDQLTVASELRLLGRRKVAGYEFTGXEGGFGEGKKAMLVLDIATIHNQPLKEINRRIWDNRIRFKDGVDIV 
DLKSGGFNPPQLLNLGFSWMQIAKSNNIYLLSERGYAKIjLKILEDDKAWEIjYDILVDEV fnmreknqvatd 
PMS I LKLT FEALEGQQQAI EEIKS DVQDLRENT P LFAI ECDEI STAVKRQGVI LLGGKQSNAY RN RGLRGK 
VYRDIYNQLYREFGVKSHKAIKRCHLNVAVKIVEEYTLPIVLSEEISFVl^AQMDFTEM 

Orf32 (SEQ ID NO : 68) 

MDQLRVIEGEKVDKPDYVEI YLGAFNINAVNELKKQDEETRSLSKDTYKKAIFYGVRYISI SKNDSLNYDYL_ 
MNRFLLISYLEN LMKVLTPRDFMTIFPIDKNYDGARYEMKDYFFTMNEIKKIGMDTPIGEKIMEFLV7DYQN 
FKDITLFNLASVSILNKLQKMQGKKTLTEEFAERLGIDTYTKHKEKGGKEYITNDRTGEIQEVKKSRPRYL 

KPVQ 

OD3 (SEQ ID NO: 70) 

MALFRKVHTEFWTDVICV/SEDMTPEDKLFIWYLLTNPHTTQLGVYEITPro^tlAFEIGLSIESARALLERFEN 

HHKLI KYN KLTREI AI KNWGKYNLN RGGKP I EDCLKREI DKVKDLS LI KFI LEHT DHAALKRKI NLYAG FD 
DTSHDTLAIRDQEEEKEQKKEQKEEQEEKEKEKEKQKEEEKEPEEEKTRIKSKASLKSDAKSNPIPYKDIL 
DYLNEKANKNFNPKAEGHRKLIRARWNEGYKLEDFKKVIDNKTTQWFGKKSFDGKPLDQFLRPSTLFAQKH 
FDNYliNETVNI SNQQHGDQIVIPGFRGEMPF 

Orf34 (SEQ ID NO: 72) 

VKKIQDSFEKLTKLKFADEQCDKHTFNKHGKEVIKLVRKMIDDAGTVYCPRCMVEEQNSVLFQQANNHYKK 
INRERKKNVLFQHSIIENQSITESRLSTYKTDCQETKENKEKAI KILERI KNGEFLNVYIAGIQGVGKSHL 
AYAMLYELVKHYWISDGEKLNDEHAFKNMKSCLFVEIEKLIRLIQHSFRNIESKYTMDYCISIMVDVDFL 
VIDDLGAESGS^RNGEASDFVHKILYGVTNGRQGANKTTITTSNLSSAQLFQKYDPKLASRLLNGVSKDE 

TIVFKTTTDKRIVNL.DIGF 
Orf35 (SEQ ID NO: 74) 

MKEVKGKNTKLMEEFDVLLRQLLIKSKTDERVKNFLDDLFEMLSDNKLQSDIDFKTALNKLREKHFPKFDK 
GESKND 

Orf36 (SEQ ID NO: 76) 

MTKEKGQAKEVVNVRGMSDDEFI EKYGRLVHHCVWKRYAKKKAS I ERDTGLDI EDLTQFGMI GLI KARDNF 
DLEFGCAFSTYAVPKIIGEI GRAI RDNQKI KVQRTVYGVKGKI LWQQLADKEPEEIADI LDES VSLVKTAL 
EYQPSTDSLNKVVYASGANEELTLERMIEDTKTEDIEETTINRAVIREFKAALPPKEYIVLDMRLQNMTQQ 

N IAN QMG Y S QVQ I S RI LAKI NQRAAQ FG KEGGLQ D 
Orf37 (SEQ ID NO: 78) 

LSVTKGVCIDVDHSDLLHEKVEYFLFPAKPSHYYVSRFNRKGAHFGCYQAERFQITEKEVWTPEPQPNLPE 
LNT S LFYRAQLI WRKKG YKDKPiKD YI VQP RGKH CY FWHDRERKKFC GCFP LHWFT DFVP VQSHH I EEKTR 
EEVKLLQRPDGQLAFF 

Orfi8 (SEQ ID NO: 80) 

MDIKKIjFAMQN I LDKRVLES KN L S RGEVFEFRI LAFLDELGECMKEWRVFKFWS DDRKPRTS I PT GEI I VL 
DDGYEVEVYKNPLLEEYVDGLHFAIGLCIDLKTEINFPASMRCETVTEQFFELYHLAIRLKEEPTAFRADV 

LLSHYLGLGELLCFSLEEIGHEYIEKNKINHERQSNGY 

Orf39 (spore surface antigen; replaces 39 in y phage) (seq id no : 82) 

MDCFKKGKFIPFPCALPIPEAGPTGPTGPPGSAGGSTGPTGPTGPQGIiQGIQGVQGNPGTTGPQGIQGIQ 
GIPGVSGPIGPIGPTGIQGVQGIQGFPGXPGPMGPIGLTGPTGIQGIQGIQGVQGIQGIQGDVGPTGPQG 
IPGIPGLTGPTGSQGVTGVTGPSGGPPGPTGATGPTGPAGGPPGPTGPTGPAGGPTGLTGPTGPTGPTGI 
QGIQGVQGTQGIPGPTGPQGIQGVQGLQGIPGIPGSMGPTGLTGPTGLQGIQGIQGNPGPTGPFGPTGPT 

GliQGIQGLQGIQGIPGSNRTSRNPRSNRTC 
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Orf40 (replaces 40 in y phage) (seq id no : 84) 

LIAHFPQKLFFFGGTNSGFQRIAGSPGADSQDIPYVLGGAGSWGLSAS^ 

IlAAPGPGQVISTIILTTTAVISGTIILTINPSDIGAQPVRVFNPNIiVIAPATVAWSSTIPGDIVARGDA^lS 
LFITPGI TQNAVY TVFLHTGN 

Orf41 (replaces 41 in y phage) (seq id no:86) 

MQHIPRYYYQSQSPMDSIWNNNNWIYAWNPYYYNYNNNAWNRNRNPY^^ 
ITOTYRTAIWTGKNLQVTI^SIIWGDDIGI^VHPT^ 

PAGKViTHNVINTGNTPI^IYAIYAPPEHPYGTVHETKAIAMSTEANRYYY 
Orf42 (SEQ ID NO: 88) 

MIVKATI KLELDDSQKNWVSYVREQGGEEAVFHYLEEEVQKKIELADFVEMKYKWK 
Orf43 (SEQ ID NO: 90) 

MDMSLVGNLKELQEKAIDEKVLEFAEEMEIVITKSAASGYSGHRYKIHNENPNRHMT^ICSKI FIEKLQELLD 
GVKVEFKEEEKKNILGGSYYEHYI RFKWND 

Olf44 (SEQ ID NO: 92) 

MTN FLLKI LFWRKGVERMKTFNVT FTELKI YEAVI EAESAEKI I DVI KHLKRTEDDLVDKGVI I NEVSEIN 
VSKEQKFE 

Orf45 (SEQ ID NO: 94) 

VNHHLFNWLRDYQKLEEDIAYLEYNLDKTKAELRRWVSGDLREVRLTAESEGAK^^NRI EAI EYELAHKMN 
DMYKLKKLISKFRGLENQILKLKYVDGMTLEEIAEAVNYSSSHIKKKHAELVRLIKFVEREGVI 

Orf46 (SEQ ID NO: 96) 

MDVQELSRRLEN LEHKVLQVETKADVLNRTAI QKGDKI KVVYPHLGI QGE YLVEKI DWGVLELVAEETMKK 
IQE 



Olf47 (SEQ ID NO: 98) 

LKKLS KQEIiAAVMTHC I STLGEQI VN EHIN PQKLAQAS ALHN DLFDNTT P KERREATI S LLGKAI DEFLES 
KE 



Orf48 (SEQ ID NO; 100) 

HGKGYFNKAVCLVCGHQDRVNHPSKKEYQEVTVCPECNGAFVDVWKLGKYKRhlTQSNEEPLLTITLTDIDA 
KPIVHYKGEQIDRKLRVTFDWESQSIDKIWRTYIHIEHVPADNKRLNTETIQHNHPIANKEQV 

Orf49 (SEQ ID NO: 102) 

MN G FN K I VN DMQM EQVGNAMLD FALAAKMMFAAFTQ FKEAG FN EEQ S FELT REI LI D SL S KNQ 



Orf50 (SEQ ID NO; 104) 

MQVYCS ECDKS YDMQPQVTQLPN RI EKCFFI CPHCNHEHI AAYVNDKI RKYQADIAKCHERINKKN1AI ED 
EMKRLRKRFDRRK 



OrfSl (SEQ ID NO: 106) 

HEGQELTLEKKDSIYLRPRYPHKIDA.SKIKSLKDVIKILGLMDIRLDDKAVIGLEHLIEKEEE 
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Orf52 (SEQ ID NO: 108) 

LKRRKNKMANMKLI I EVTADTTE1ALEGIKEVTEAANECADALDKLEKIMDKFTNRSDTVELYCEGKLLSKS 
TVNHTADSIQCRIIKGEELGGSER 

Orf53 (SEQ ID NO: 109) 

MKKPLRPCCEFHCYNLTRERYCEEHRYKEKETQQDKNRYYDRFKRDKESTAFYRSKAWERLREQALMRDKG 

LCLHCKNNRKIKVADMVDHIIPI1CVDPSL 

KIiKLENLQSLCNPCHNRKTAEDKKKYG 
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Figure 3 



BEST AVAILABLE GOPY 
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Figure 4 
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Figure 5 
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BEST AVAILABLE GOP 
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Figure 7 
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